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BI48 SEQ ID N0:1 

ATCGCC6ATCCAAT6CCAAGGA6TACCTGGGCAATCA6AGCCTACTCAC6GCTGCCG6G6CCGGCATTGCCAAGCTCCTG 
GACGCCGACGAGMCAACACCAGTACC6TCTTCA6CGGCAACGGCACCAOTTCGGGACGACCGGAACCAACAGCAACTC 
GGCCCTCAACAGCATCCTCTCCGGCGGCGTCAGCGACATCCGGCAGTGGATGAACAAGTTGTACGGGGAGGCCTTCGCCG 
CCGTCTACGTGCAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTGGCGATCGACTATGAACTCAAGGGCCGC 
AAGGTCGATTACAGCTCTGGAGCCGCTCATGCAACAGCAGACTTGGACTAACCCCCTTCTTCGTCTCTGCGCCGGCCTGG 
CCTGCGCGCTGACCCTGGCAGCGTGCTCCACCAGCAAGGAGGAGATGCTGCCCCACGGCGAGGCCAACATGCTCGACGTC 
TGGGAGCGAGGTGCGACCAGCTCGATAGGCAACAGCCGTGGCCGGCTGCTCCTCGATGCCAGGCAAACGCTGCGGCGCCC 
AATCGATCCGCAGCAGGATGCCTCCGCGAATGACCAGGCCGACTACACCCGCACGGCCAGCAACGAGATCCACAGTCAGT 
TCAAACGACTGCCCAATCCCGACCTGGTGATGTATGTGTTCCCGCACCTGGCCGGCAGCGATCCCGCCCCGGTACCGGGC 
TACACCACCGTGTTCCCCTTCTACCAGCGAGTCCAGTACGCCATGCCGGGCGAACGCACGGAGGACTATTGATGGGCTTT 
TTTCAAACCCTTCTGCGCGGTCGCACACAGCCTCAGTCGGTACCGGCAGACGCTCCCGAAGATTCAGGAGCGCTGGACGT 
AGCGGCCGCGGAAGAAGCGACTGAGCGCTATCTGGCGCGACTGGCCGCCATGGGTATTCCTCTGCCCAACACCGGGAGCA 
AGAATGGCGCCACGCAGGCCGAAGCGTCACGCCTCTACGATCACGACCCATCGTTCGTAGACCTGCTGCCCTGGGCTGAG 
TACCTGCCCGACGAGCAAGTGATGCTCCTGGAGGATGGGCGTTCGCGCGCCGCATTCTTCGAACTGGTGCCCTTGGGCAC 
CGAGGGCCGCGATCCCAATTGGATGCAGAACGCCCGGGACGCATTGAAAGAAGCCCTGCAGAACTCCTTCGACGAGCACG 
AAACCTCACCCTGGATTGTCCAGTTCTACGCCCAGGACGAGATCAGCTGGGACAATTTCCAGGAGCAGTTGAGGCAGTAC 
GTCCATCCTCGAGCGCGAGGATCGGCCTTCAGCGAGATGTACCTGGCGCTCATGAAGCATCACCTGGAGGGCATTTCGAA 
GCCGGGCGGACTGTTCGTCGACACCGCCGTCAGCAAGCTGCCCTGGCGAGGACAACAGCGCCGCGTGCGGATGGTCGTCT 
ACCGCCGGATCCGCAAGGAGGATGCGCAGATTCGCGGACAGGACCCGGCGGCGTACCTGAAATCCATCTGCGAGCGTATC 
CAAGGCGGCCTGGCGAACGCCGGCATCGTCGCTTCGCGCATGGGCGGACAGGAGATCAGGAACTGGTTGATCCGCTGGTT 
CAACCCGCACCCGGATCACCTCGGCCAGGCCGAGGCGGACCTACGTCGCTTCTACGAACTGGTATGCCGTCCGGACGAAC 
CGATCCTGCAGGATGAATTGCCACTGGCCGACGGCACTGACTTCTCCCAGAACCTGTTCTATCGGCAGCCTGTTTCCGAT 
GCCACCCAGGGCGTATGGCTCTTCGATGCCATGCCGCACCGAGTGATTGTGGTCGACCAGTTGAACAAAGCGCCGCTGAC 
AGGCCACTTCACCGGCGAGACGCTCAAAGGCGATGGCCTCAACGCCCTGTTCGATCGAATGCCCGAGGACACGCTGCTGT 
GCATCACCATGGTCGTGACGCCGCAGGACATGCTGGAAGGGCATCTGCAGCAGCTCTCGAAAAAGGCCGTTGGTGACACC 
CAGGCCTCGATCCACACCCGCGAGGACGTGGCCACCGTTCGACGCCTGATCGGCCGGGAGCACAAGCTCTATCGCGGAGC 
GATCGCTCTGTTCGTGCGCGGCCGCGACCATACCCAGTTGGAGGAACGCTGCATCACCCTGAGCAACGTACTGCTCGGCG 
CCGGCCTGGTGCCGGTCGAACCGCAGAACGAAGTCGGACCGCTGAACAGCTACCTGCGCTGGCTCCCCTCAAACTTCGAT 
CCAAACGAGAAGCGAGCCCTGGAGTGGTACACCCAGATGATGTTCGCTCAGCACATCGCCAACCTGTCGCCCATCTGGGG 
GCGCACCACCGGTACCGGACACCCTGGCTTCACGCTGTTCMCCGTGGCGGCGCGCCGTTGACCTTCGACCCGTTCAACA 
AGCTGGACCGGCAGATGAATGCCCACGGCTTCATCTTCGGGCCAACTGGCTCCGGCAAGTCGGCGTCCCTGACCAACCTC 
ATCTGCCAGATGCTCGCCATGTACCTGCCGCGGATGTTCGTCGCGGAAGCGGGCAACAGCTTCGGCCTGCTGGCCGACTT 
AGCCAAGCGGTTTGGCCTCTCGGTCCACCGGGTGCGCCTCGCCCCGGGCTCCGGCGTCAGCCTGGCGCCGTTCGCGGACG 
CCATCAAGCTGGTCGAGAGCCCCGACCAAGTGAAGGTGCTGGACGCCGAAGACATCGAGGCCTCGGACTCGGTCCAGGGC 
AGCAAGGCCGACCTCGAGGACGACCAGCGAGACATCCTGGGCGAGATGGAGATCGTCGCCCGCCTCATGATTACCGGTGG 
CGAAGAGAAGGAAGATGCGCGCCTGACCCGTGCCGATCGCAGCGCCGTCCGCCAGGCGATCCTGGCGGCGGCCAGGACCT 
GCGCCGCCGCGAACCGCACGGTACTGACCCAAGACGTGCGCGATGCGCTCTACGAGGCCTCCAGGAGCGATAGCACCGCG 
CCAGAACGCCGCGCGCGGATCGCCGAAATGGCGGAAGCCATGCAGATGTTCTGCATGGGCGCCGACGGCGAGATGTTCAA 
TCGCGAAGGCACGCCCTGGCCTGAGGCCGACCTTACCGTGGTGGATTTCGCAACGTACGCGCGCGAAGGCTACGCCGCCC 
AGCTCGGGATCGCCTACATCTCGCTGCTGAACACCGTGAACMCATCGCCGAACGCGACCAGTTCAAGGGCCGGCCAATC 
GTCAAGATCACCGATGAGGGGCACATCATCACCAAGCACCCGCTGCTGCTGCCCTACGCCATGAAGATCACCAAGATGTG 
GCGGAAACTGGGCGCCTGGTTCTGGCTCGCCACCCAGAACATCGACGACATCCCAGCCTCCGGGGCGCCGATGCTGAACA 
TGATCGAGTGGTGGTTGTGCCTGAACATGCCCCCCGACGAAGTAGAGAAGATATCCAGGTTCCGCGAGCTGTCGCCGGCG 
CAGAAGTCGATGATGCTCTCGGCCCGCAAGGAMGCGGCAAGTTCACCGAGGGCGTGCTCCTGGCCAAGGGCAAAGAATA 
CCTCGTCCGTGTGGTTCCCCCGAGTCTCTACCTGGCCCTGGCCATGACCGAAAACGAAGAAAAGAACCAGCGCTACAACA 
TCATGCAAGCCACCGGCTGCGACGAGCTCGAGGCGGCCTTGCAGGTCGCAGCGGATCTCGACAAGGCGCGCGGCCTGCCA 
CCCTTCCCCATTGTTTTCCCAGACCAACCGGCAGTGGAGTGCCAGGACGAATGAGAGTTCTGAATTCGCTGACCCAGAAC 
CTGATCGACAACCTGACCCAGATCCTGCAGAACCCCGAAGAGGATGCCCTGCAGACGCTAAGGATATGCGCTCCTGTACT 
GATAGAGGAGCTGCAGCAGATTCAACTGAGGGCAGTCGATCGCCGGGATATCGTCCCGCAGATAAAGCAGCTCTTGGATG 
AATGGCTGCAACAACATCCACAGCCTGATACGGCCCAACAGGCGCTCATTGAGGCCGTGGACCGCGCGGAGATCCTACAG 
CGGAGGCAAGCGTGAGACTCTTGAAGGGCGGCTGGGCAGCCAAACGATTTCAAGGTCCCGCCCTGCCCTGGGCGGGGCTG 
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CTGCT(MTCTTGCTGGCTGCATCC(raTA^ 

CGATGCGAAAGCGCGCTGGACGATCAATGAATACGCCGACCTGGAGTGCCCCTTCTGCAAGGTCTACACCCCGCGGCTTA 

AGCGCTGGGTAGACAGCCATCCGGACGTGAACCTGGTTTGGCGCCATCTTCCCCTGCAGATGCATGGCGAGGCGGCCCGC 

CACCAGGCTCGCCTGGTGGAGTGCGCGGGGATCCAAGGCGGCGCCAAAGCCTTCTGGAGCGCTATCGATGCGATCTTCGC 

TCAGTCGGCCGGCAACGGGGGCGGGCTGCCTGGCGGCACATTGGACTTTCCTGAACTGGACCAGGCTCGACTGGAGAAAT 

GTGCGAMGACAACGAACTTATTGACTCAGATATCMGTTGG^CATCGA(^TTGCACGGTCGAAGGGCATTACAGCGACC 

CCGACCCTCGTCATCCGGGACAACCAGACGGGACGAAGCGTGAAGCTTGAAGGCATGGCCGACGAGACCACGTTGCTGTC 

GGCGATAGACTGGCTAGCC^(WATCTCTAGCGTC(K:GCCAAGAGACTCTTGGCTAAGAAATCGGCGAGGATTCCAACAT 

CCCTCTTTTGGTCCTCCAGGATGCCCTGCACTTCACCTGGCAGMCCTCGACCTCCTCCCCATCCACAATCTTTACCATT 

CTCTTGTGGCCGGAGCTGGTGAGGCTAAGCCTCAACTCCATTGCCGGCCGAGCATTGATGTAAATGCTCTCGAGCAAGCG 

CTCCATGACTTCGACCACTCCTTAATATCAGTTAGCCAGCTACATACAGGAATTATGCTACCCAGGACATGCAGGCGTCA 

CCCCTACTTATGTACGTGGCAGCGTTCGATCACGGCTCGAAAAAATACACCACCTACGAGTTGATGATTTCCTGCTGCCC 

TATCGGAAGTGATCCTGTCTGCTGTCTGTACCTTTCTAGAACCGGTACAGACCCATGCCTCTTCATCACTCCCCCCCTGG 

CCGGCGGCCACCAACGCTGGCCGTTGGCGTACTACTGGTACTGCTGAGCAGCGCGAGTCAGGCCGAAACCTGGGTCATCA 

CCGACAAGGCTCATCCGGTCTCTGCCACCGGATCGTCGCGCGTTCTGTTTCTGGACGCCCAGGAACACCTCGAGGAGCAA 

CTGACTGCGGCCTTGCCCCAGGATCCACAGCATGCTCAAGCGGCGTTTAAGCGATTGCTACAAAGCCCCGATGGGCGCCG 

CCTGCAGGCAGAGCTGGTCAAGGCACAACAAGACGTCGCCGATGCGTGGAGTCTCGGTGTCGAGAAGATCCCTGCCGTAG 

TAGTCGATAGGCAGTACGTGGTCTACGGCGAACCGGATGTTTCGCGCGCTCTTGAGCTAATCGCCAAGGCCAGGAGGTCG 

CGCTGATGACCAGCCTCAACCTCCGCCGCCTGGCAGCGGCGGCCGCCACCTTCAGCCTCTCGTTCACGGCCTCGGCCGCG 

ATCAACAGCGCTGCCATCGTCTCCTCCACCCTTTCCCCTCAGTGCCTCGMTACMGGTCGTCGGGATCTGTTACTGGCT 

GCTCTGCGGCCCGCATGGCTGCAAAGTGAAGACGTCGGTCAAGGTCCGCCACTACGTGCCTGACGCAGTCGTCTCCAGCT 

ACGCGAATACCGGGAGCAACCCCTGGACCGAGGTATCGGCGCTGGGTACACCGAATCCACTCGCCCAGGCCGGCAATGAC 

GCGACCACMACTACMGGCCGAGAACAGCATCGGCCGCTTCAAGGAAGCGGATGTGATCGGCCATCCTGGTGGCGCCAC 

GTTCAGCCGGTTCGCCAGCGCCTCTGGGTACGTTTGCCCTGGCGCCACCGTCCCGCTGGTGCCGTACTTTCTCAGCACAC 

TGGACGCCATTGGCTGGCGGCATGGAATTCCCGAGCAGGTGTACCCCGAAGCGTTGGTCCCAGGGCTGCGCGAGGTGGGT 

GGAATCTTCTCCGGCGACATGTGGGGGAACCTCTATCCGCGCAGCGGCTTCCTGCACCAGACCGACGACTACAAGACGGC 

AGCCGTCATCGCCCAGCGCGCCGGCGATATCACCACGCGAATCGGCCAGCTCCACGTCTACCTCCCCATGCGCGCAGCCC 

CCAAGGACGGCTACTG(K:CGGCGGGCGAGCTGAAA(3AGGGC(^TGCCTCGACCGGGAAATGGCAGGAGCTGACCCCATC 

CTGAGCCTCAACTGCGCGGTGTTTCCCAACTCTGGGCCGAAGACGCAAGCCGTCGACGGGGAGCACGCCTGGGCGCTCTG 

GCGTCCCTACTCCTGCTGCCAGCGCAAGGGGCAGATGTTCATCTGCAGTACCGACTTCCAATAAGGACACGGAGACGAAT 

CATGCGAATGAACATCACCTCGGTCGCGCTAATGTGGCTGCTCGCAGCGCAACTTGCCCAGGCCGACGACCCGATCAACG 

TGTCCAAGACCGGCACGGTGCTCAGCGACGAGGTCCTCTACAGCATTGGCGGCGGCAGTGCGGTGAGCATGGGCAGCGCC 

GGCCAGATGGACTCGATCGGCGTCGGCTTCGGCTGGAACAACGACATGATGTGCGGAAACATGAACCTGAGQCCACCCT 

GGAGAACCAGCTCAACGGTGCCACACAGGGnTCCAGMCATCATGGGCTCAGTCATCCAGAACGCGACCGGCGCGGTCA 

TGTCGCTGCCGGCGTTGATCATCCAGCGCGCGAACCCTCAGCTCTACAACCTGATCACCAATGGCATCCTGCAGGCGCGG 

ATCGACTACGACCGCTCGAAAGGGACTTGCAAAACGATCGCCGAAAAGATGGCTGACATCGCTGGCGAGCAGACCGGCTG 

GGGGAAAATCGCCGAAGGCCAAGCCCTGGGCGCCACACTGGCCTCTGACGGGAAAGACGCCGTATCCGCCCTCGAAGCAG 

TGGAGAAGAM(MCGGCAACGATGGCGTMCCTGGGTTGGTGGAGACMGGCCG(K:GGCTCC(MC(^GMGCC 

ATCGTCAACGACGTGACCCGGGCGGGCTACAACCTGTTGACCAGCCGCTCGGTGAATGATTCGTCGAGCGTGCCTTCCGC 

CACTTGCAACAACGGCCTGGTCTGCAACACTTGGTCCTCCCCCCAGGAGGCCGCCGCATTCGCCACCCGGGTACTGGGGG 

AGCAACAGCAACAGACCTGCGAAGGCTGCCAGAAGACGGTGACGGCTGCTGGCGTCGGCCTCACCCCGCTGATCCAGGAG 

ACCTACGACAAGAAGCTCCAGTCGCTGCAGGAGCTGCTGTCGAAGAGCAAACCACTGACTGCAGAGAACCTGGCTGCGGC 

CGGCACCGATGCTCTGCCAATTACCCGCGGCGTCATCGAGGCGCTGCGCGACGAGCGTGACCAGGACGTCCTGGCGCGCC 

GCCTGGCGTCCGATGTCTCCCTGATGGACGTGCTCAGCAAGGCACTGCTACTGCAGCGCCTGATGTTCGCCGGCGCCAAG 

GAGCCCAACGTCGCCGCCAACGGCCTGGCCACCCAAGCCGTCGATCAGCAGACCAGCCTCCTGCAGCAGGAGATCTCCAA 

TCTCAAGACCGAACTGGAACTCCGTCGCGAGTTGGCCAGCAACTCCCCCATGCGGGTCATCGAGCGCGGGCAACAACGCG 

CCTCAGGGTCCAGTGGCGTGTTCGAGTCGGCGCCCGATGCCGATCGCCTCGATCGCCTGCAGGCCCCCTCTGCCGCCGGC 

GGCAAGTCGGGAGGGAGACCGTGATGGCAGATACGCTCACCACCCGAAAGCTTCTCGGTCAGCTACTGGTCGGAGTGCTG 

ATCGTCATCGGACTGGCAGTGGTCGGTACGCTGCTCAGTCTCTTCGCCCTGAACCACTTCGGTGGCATCCAGGGCCTGGA 

GGCCTGGCGGCAAAGCAACTACTGGAGCTTGTTCGCCTGGCGGGCGCTGCTGTACTGCGCCCTGGCCATCGCCTGGTTCC 

GGCAGCGCAAGGAACTGAGCGCGCATGAGCGGCAGCGCATTCGGCGGATCGAGATCCTGGTGCTGTTGCTGGTCCTGCTC 
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ATCGAATTCAGCAAA6CCTACTTCC6CACG6GA6GCGCAGCAT6ACCTTCATGACCAATGACTACCTGGA6TATTACCTC 

ACCCTCCTCGGCTGGATCATCAACAACGGGATCTGGAACATGATCTCGGATACTGGCCTGTTCGCGGTGCCGTTCGCGGC 

CATCGTGATGCGCGAATGGCTGAAAGTTCGTGGGGAAGGCGCCGACGAGGGCAACAAGGGAGTGCTGTCTCTCGCCCGCA 

TCGAGACGCATATCTACGTCGGCTACATCGTGGTCGCCCTGGCGGGGATCCCGGTCGTCAACGTGAGCTTCGACACCATC 

GAGTTCGACCAGACTCGCGCCCAGCAGTGCCAATACAATCTGCCGGCACCGGCGGACACCGGCTGGTCGAGCTCCTTCAG 

CAGCCTGGCCG^O^GAGTGCGCAGATGCCGCTCTGGTGGGCGATGATGCACGCCCTGTCCAAGGGCTTCACCAGCGGCG 

CCATCGCGGCCATTCCGTGCGGCACGGATCTGCGGCAGATGCGAATGGAAGTGGACAACACGCGCGTGAACAATCCGCTG 

CTGGCACAAGAAATCGCTGATTTTTCCAGAGACTGCTACGGGCCTTCCCGTGCGCGGCTGTTCATGCGGCAACCCGACCT 

GGGCTCCGTCGCCGAGGACAACAAGGCGTTGCAAGACCTGAACTGGATCGGCTCCCGATTCTTGTTGAACACCCCGGGGT 

ACTACGACACCGACTACTCGAAGAGTCCCCGTCAGTCGTGGCCCTACAACGCCACCCGCGATGCCGGCCTGCCTCAGGTG 

GGCGGTGGTGGCGGCTACCCAACCTGCAAGCAGTGGTGGGCTGACTCAGGGATCGGCTTGCGTGATCGGATCAAGGACCA 

GGTGGATCCGGACCTGATGACCAGCTTCCTCAAGTGGGCGAAATGGTTGAACCAGGACGAGGTGACCGAGGCTGTCATTC 

GCCAGGTGATCTCACCCTCCA(K:CAGGTCMGGGTAACGTCTACACCGATTACGGC(MGCAGGTGGGCGG(^CCGT 

AACGGCATCGCGAGAACCGCAGGAACCTTCGGCGTTGCGGTGGGCAGCTTGGCATACTTCCCGGCGATGGATATGGTCCG 

CCAGGCACTGCCGATGGTGATGTCGTTCCTGAAGATGGCAATGGTCATCTGCATTCCGATGGTCCTGGTCATCGGCACCT 

ATCAACTGAAAGTTGCCATGACGATGACGGTCGTCTTCTTTGCGATGATGTTCGTCGACTTCTGGTTTCAGTTAGCCAGA 

TATATCGACAGCACGATACTTGATGCTTTCTATGGTTCGGGATCACCACATCTTTCATTCAACCCAGTCATGGGGCTGAA 

TACGGCTACTCAAGATGCGATCTTGAACTTCGTTATGGGTTCTATGTTCATTGTTTTACCACTACTGTGGATGACAGCGA 

TCGGCTGGTCCGGAATTCAAGCAGGGTCTGTTCTGAACGGATTGAGCAGAGGGACTGAAGGAGTTCAAGCCGCCGGCAAG 

GAAGCAGGAAATAGAGTTAAAAACGCAGTTTGAGGAAGTATAAAGCCATTACCGGGCTCTAGTCCCGGTAATGGTGTGCT 

GCCTGATAGTCAAACAGCTCTGCTACAGATTCAGTTTCAGCTTGTGAAGTCATAGTTGAGACATCTCGACTTGGAGACCC 

TTTAGAGACGGAGATGTCCCCTTTAACCAAGGCAAATCCCACCAGAAAGACACACAGTAGAGTCATTGATACCCCAGCAA 

GCCACATAAAATTCTGTGACACTACAAGAGCTACTAACAAGAATAGGCTAACTCGCTTAACCCAACGTAGAGCTGGACGG 

CGATCATGCAGGCAGAAACGCACAAGCATACCCAGACCAAAACCGATCCGGGAGGCAAAGCCTTTGTTGGTGTGCGCGTT 

CATCATCAATCTCCTGGCTCCCAAAGGGAGGCATCCTGCTATCACCTATACGCCGAAAAAGATGATTTGGCAAGCATTAT 

GGCATATTATGCCACTAGCTATCTGCCGACTGGAGTACCTCATGGCAACGCGAAACGTCGTCCTTCCCGATCCGCTGGAG 

CAGGATATCAACGAGCTGGTGGAGACCGGCCGCTATCAGAATCGCAGCGAAGTCATCCGGGCAGGCTTGCGCCTGCTGCT 

GCAACAGGAAGCCCAGATANGCGCCAAGCTCGAAACCCTCCGCAACGCAACATCCAGTGGGCTGATGCAACTGGAGCGCG 

GCGAGTACGACGAGATCACCAGCGACGAACTGGCCCAATACCTCGACGAGCTCGGCAACCAGGCGAGCCACTGAAGCATG 

GCCAAGTACCGCATCTCTCATGATGCCCMGCGGACATCGTCGATATCCTGCGCHCACCCACAACCACTTCGGCGATGC 

CGCGCGCCGACGTTACCAGGCACTCATAGGGGCGGCGCTGGAAGCAGTTGCGACAGACCCACAACAGGTAGGCAGCATCA 

GCCGTGAAGAACTGGGAGCTGGCCTGCGCAGCATCCACCTCGTTTACTGCCACTCGATGCCCAATGTCGGTAAGGTTGTT 

CGGCCCAGGCACTTCGTCTTCTACCGGGTGGCGACAGACCAGGTGCTAGAGGTGGTTCGCGTGCTTCACGACGCCATGGA 

TGTGGATCAACACCTGCCCCAACGATGAGCAACTGTAACAGGAGCAAATGGCCCAAAGGGGAAATGGGCTTGAAGGGCGA 

AGGGAATGGTTCACAAGGGTAAAAGCCCTACCCGAAAGGGCCTGCCGGAAGGCAGAAAGGGCTGCGTTCGCGCCGGCGAA 

CACGAAAAGGCTACGGGTGAACATCCACCAGCAGGTGCCCCATGGTTGATGCGCAATAGCAGTGGCTTGACCACTCGATT 

GCATTCCCCCTGACCAAGCATTTGCATTCGTGCTGACCACCGCTTGCATGCGGCTTTACCAGTCACACCGTAGTCACGAC 

CGGCTGCAGTCGACCCAAAGCGGACCTTTGTGACAGACCGAAATCGGCCAAAAGCAGACGCTCAAAAGCATCTACGAAAG 

CAGGGACAATTCCGGGGGATACTCCTCTACGTACGTTGCTTATATCGCGGAGGAATTGGCTGAACCGCGGTGCGGAGAGA 

TCTCCTCAAACTGATGGGTTGCACGCATATCGAAGCAGATTACATAGGAGGCTTGCGCTGTTCAACAGCTCCTGAGGGGA 

CTTGGGTTGCCCATGGTTTCCACGGCCCAATCGTTGACGTCATTGACGATTCCGCTGGCTTTTTCAGTACGCATCGCTTG 

GCGCTCCATTACCCAGCCCAATGCGGCCTTGCCGTTGACCAAGCGATTCCAAGGACTGCGATCCATGTAGCCAGCCCTCT 

AATGCATGTATGTATAGGTAAGGTCGTCGTTATTTCGGCGTGGATGTGCTGACTGGAGGTTTAGCGTGGTGAGTCAGTAT 

GTCGTGCGCTATATAAGTCTCATCTCGGCATCCGCACATGGATGCTGTTAGATTCAGCAAGTGGGAAAAAGCGCGAGCGC 

TAAAAGGAAAGCTGCCGGCCGGCGTGCTCAGTAGCAAGCTACCCCGGCAACACTTAGGAAGAGGTGATCATGAAGTTACA 

GGCATATCGGCTGCAGAACTACCGCCGGCTGCGCGATGTTGTCATCGAGCTCGATGACGAAATTTCTATCTTTGTCGGTG 

CCAACAACAGCGGGAAGACATCCGCCGTCCAAGGCCTGTACTCAATGCTTCGCGGCGAAGTGAAGAAGTTCGAGCTCTTT 

GACTTCAGTGCGGCGCTGTGGGCCGAGATCGATGCGGTCGGCAGGACGCCCCCTGGCGATGAGGATGCGCCCAAAAGGTT 

ACCGTCCATACTCTTGGATCTCTGGTTCCGCGTCGGTGAAGACGACCTCGCCACTGCGATGTCGCTGCTGCCGAGCACTG 

AGTGGGACGGCAAGTGCGTCGGGATCCGGGTAGCGTTCGAGCCTCGGGATGCCCACGAGCTCGTCTGGAAGTTCCATGAA 

CTACATGAGAAGGCCAACAACGCAGCTGTCGCGCTTGCGGCCAAGCGCAAGGCCGCCGGGGAGCAAGCTGTGGAGGCGGG 
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C6CG6AAGAC6CGGCTGCGGT6GTGGCC6ATGCCG6C6A6TACAAGCCTT6GCCA6AAAGCCT6AC6AAGTACCTCACAA 
AGGAACTGAGCAAGGAATACACCTTCCGCTACTACGTGCTCGATGAGCGGGCTTTTGTCGGCTATCAGGCAAGGGAGGCC 
GACTACGAGCCGCTACCCCTAGGCAAGGAGCCGGGCGGTGCAGCCATTCTCAAGTCGCTGGTGAGGGTCGACTTCCTGCG 
CGCGCAGCGGCACCTCGATGACCCAGATGCCGGTAGCTCTGATCGCGCAGAGAGCTTGTCGCGGCGTCTGAGCAGGTTCT 
ATCACCGCAACCTGGAGAAGCGTGGCGACGACCATGCGGCTCTCAAGGCGCTAGATACCTCGGAGAAGGAGCTGAACTTC 
QCCTGAAGGAAGTCTTCAATGACACCCTCACGCGCCTGGCCAAGCTCGGCTATCCGGGCGTCAACAATCCGGAGATCGT 
GATTCGGGCGGCCTTGGATCCGACCACTGTCTTGGGGCAAGACGCCAAGGTTCACTACGTGATCCCGGGCGTAGCTTCCG 
CCCAACTGCCAGACAGCTACAATGGCCTGGGGTTCAAGAATCTGGTCTACATGGTGGTTGAGCTGCTCGACTTGCACGAG 
CAGTGGAAAGCCGAGGATGACAAGCGAGCTCCGCTTCATTTGGTCTTCATTGAGGAGCCTGAGGCGCATCTGCACGCGCA 
GATCCAGCAGGTCTTCATCAGGAACGTHTGCGCCTCCTTGAGGATGCTAACGATCACGCGACTTTGTTCCACACGCAGC 
TCGTCATCACCACGCACTCCCCGCACATCCTCTATGAACGCGGATTCTCGCCCATTCGGTACTTCCGCCGCGTCAACGAC 
CAGTTGGGCCATCACACGGATGTGCGCAATCTGTCGCTATTCAAAACGGGCGCGTCCGACGCTCCAGCGCGCGAATTCCT 
GCAGCGGTATCTGAAGCTGACGCACTGCGATCTCTTTTTTTCCGACGCGGTGATATTGGTGGAAGGCAACGTCGAGCGTC 
TGCTCCTGCCTGCAATGATCGAGTTGGTGGCCAAGCGCCTGCGTTCTTCCGCCCTAACCATCCTTGAAGTCGGTGGTGCG 
TTCGCGCATCGGTTCCAGGAGCTGATCGCCTTCGTTGGGCTCACAACACTGGTCATCACGGATCTGGACAGCGTGACGGT 
CAAGACGGACGCCGAGAAGGCCGCCGCGCAAGGCGCAGGCGCTGAGGGCGCCGTTGACGGAGATGACGAGGACGAGGACG 
ACGACCTGAAGCCCTTCGAGCTTGAAGACGACGACGAAGCAGAACCGAGTGGCAAGAAGAAGTCCAAGAAGCGTGGCAGC 
ACCTGCCATGCACACGTGGAAGGTGCCGTCACGTCCAACCAAACCCTCATCAGCTGGATCCCGAAGAAGCGGTCGATGGC 
AGAGCTCTGGGAAGTCACGGCGGAGCAAAAGACGCTGTCGCTGGCTGAGGATTCCAGCGCTGGGGTTCGGGTAGCTTACC 
AGACCAAGGTTTCGGTGACGGTGGGTGCGACGACATCACAGCTCTGCGGCCGCACACTTGAGGAGGCCTTTGGTCTTGAG 
AACGCGGACTGGTGCCAGGCTGAGGCAAACCGGTCGGTCGGCCTCAAGCTCAAGCGCGCACCGAGCAGCCCTGAAGAGCT 
GGCTGAGAAGTTACACGATAGGGTGGTCGGCAAGAACTTCGACAAGACCCGCTTTGCGCTGGAGGTACTCGCAAGCGGGC 
CGCTCAATGGCTGGAAGGTTCCCGCGTACATCGCCGAGGGCTTGGCCTGGCTCGAAGCCAAAGTGGCCCACGAGCTTGAG 
GCGGATGCTGCCATCGCCACCGAGGTCGCGACTATTGAGCCGACTACAGCCGATGTTGTCGCTATCATTGTTGACCCGGG 
GCAGACGGCATGAGCAGACGAATTGATAGCCCAGATACCGACGCCGACCGCGAGATCCACGCATGCATTGTAGCGACGCC 
TCCGCAGCCCTTCGTGGTTCGTGCTGGCGCAGGTTCCGGCAAGACCACCTCCCTCATCAAGGCGCTGGACTGGGTGATCT 
CGGAGCACGGCGCCAGCATGCGGGCGAGGAAGCAGATAGTCGCGTGCATCACGTATACCGACCTTGCCACCAATGAAATC 
CTGGCGGACGTCAACGATGACCCGCTGGTTCATGTCTCGACCATCCACAGCTTTTACTGGTCTATTGCAAAGACGTTCCA 
GGCCGACATCAAGGTTTGGCTGCAGAACGACATCCGCAGGCGGATCTCCGAACTTGAAGAAGAGTTCGAGAATTACAGCT 
CGCGTGTCCGGCAGACCACGCGCGACAGGAACAAGGCCGACCAAGAGCGATATGTCCGAAGCCTGGAGGCTGTGGCCGGC 
GTCAGGACGTTCAACTACGGCGTGGGCAGTGACTACGCCAAGGGCATACTTGGCCACGAGGACATCCTTCAGCTCGCCGA 
CTTCCTGCTACAAAACCGCCCGCTGTTCCGACGGGTCGTGGCGCTGAGCTACCCGTTCGTGTTTATCGATGAGAGTCAGG 
ACACGTTCCCGGGTGTAGTGAAGTCTTTCAAGGAAGTGGAAGCCCAGATGCAGGGCAAGTTCTGCCTTGGTTTTTTCGGC 
(^CCCGATGCAGTCGATCTTCATGAGAGGCGCAGG(MACATCCA(^TTGAGGATCATTG«:GGGC(^TCACGAAGCCGGA 
GAACTTTCGCTGCGCCAAGCAGATCCTTGACGTCGCCAATGCCGTGCGCGCGCAGGGCGATGGCATGGAGCAAGTCCGCG 
GGCTGCACGAGAGGGTCGATGGGAACCTCAAGCTGGTGGAGGGGTCGGCCCGGATGTTCGTCTTGCCGAACACGCTGAAC 
CGAACCGAGGCTTTGGCAAGAGTCCGAGCGTGGAGCTCGGCGACGAACAACGACGAGGGTTGGACAACCCCAGACATCGC 
AGTCAAGATTCTTGTCATCGTGCACCGCATGGCCGCAAACCGGCTTGGCTTCGGCGGCATCTACTCGGCGCTGAACGACA 
AGACGTCGGATGCCATGAAGCAAGGGATGCAGGACGGCACCGGTTGGCCCGTTCGACCCTTCCTAAGTTTTGCGCTACCG 
ATCGTTGCAGCTGTGAAGGCCGGCAATGAGTTCGCGGCGATGAGCCTGCTCCGGGAATTCAGCCCGCGCCTGGCGCCTGC 
GGCTCTGACCGGCCGACGTGCCGCGGATGTATTGCGAGAGCTGCACGCTGCTGCGTCGAGGCTTGTCGCCATGCTGGACG 
AGGCAGGGACCACCATTGGTGACATAGCTCTCCATCTCTGTGACACGGGTCTTTTTGAGTTCGACGAGCGCTATGCGCGT 
GTTCTTGGGTTTGTCAGGGATATTGCTGACACCGCTCAGGAGCCCGAGGCTGCTGATGCAGTTCCGGCCGAAGGATTATC 
CTTGGACGCGACAATGGCCAAGTTCTTCAATTGCTCTGCGCAAGAGCTTTGGCCCTATGAACGCTATGTCTCAGAAGGCT 
CCCCCTATGCCACGCAGCACGGCGTGAAGGGAGCGCAGTTCGAACGCGTCATGGTGGTGATGGACGAGGAAGAAAGCGAC 
TACCGAACGTACAACTACGAGCGTGTCTTCGCGAGTGCTGAGGCCCGCGCTGCAGATCGTGCACGAGCACTAGACGGTGA 
TGAAAACACTTGGAGCCGAACGCTGCGACTGCTTTACGTCTGCTGCACTCGTGCCCAGCGGGGGCTGGTACTAGCGTTCT 
TTGTCGCCGACCCTGCGACCACCCTGGAAAACGTCGTGGCGAGCGGGATCTTGCCGCGAAGCGCAGTCTTTACGCAGGAA 
GTGTTAGTTGGATGGCCATAGACGAACGATTGCCTGACCTATTTGAAGCCAGAGCCGAGCGGCAGGCTCGGTAGCTAGCG 
GGTGGTAACTCTGAGGACGTAATCGACTGCAATTGGCCGATTTCTGCCTATGACGAGGGGCAGCTACCGACCCATTGCTA 
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TCCTTC6TAA66GTG6CTACGCACC666CACGCTGCGG6ATCG6CTGAAGCTAAAAAATCGCACAACAGCTTT6TAAAAC 
AAGATCTCGGTCGATAACATCTATGAACGGTCTGGGTTGGCAATCAGTATTGGCTGCCAGTTCTTGTGACGATGCATGGT 
TTAAATCGACTCTTTGAGGAAATGCGTGGGAAGCCTGGAAAAGTGCTGTTTCGCCTGCAAAGAAATAATTCATGTTCATG 
CGATTCGTTGTCGGCAGTGCGGCGAGTCCCAAGGCTGGCGAAGGTTCATGAGCTCTCCAACCTCAGTAGTTGCGTTGGTC 
CTTAGCCTTTTATCAATCGCTGCCACAAAACCTGTGGAGCGATTGTTCGATGCCCAGCGAGCAGAGCTACAAATCTCCAT 
CACGGGTGGTGATTACAAAGCTGCCCAGCTTATGTTGACCAATAACGGGTCAAAGCCTGCAACTTTAGTTTCCTTCGAAA 
TCACATCGAAAGCCACGACCAATACGAAAACATGGTTTTTGGTAAGCAATACGGATGGCGAAATTCTGGAGCCAGGCAAA 
ACTTACAAAATCAGGGCCTCAACCGATGAGTCTATCCCAAAAATTGTCGAAGCTGAGCGTCGGACGATTTTGAAGTCTCA 
GTACGCACTTGCAGATAATTGCGAATTAACCGCTAAATACATAGAGGCCACGGGGCAGAAGGTTGTGCGTGTGCAACCGT 
TCATGTGCGACACACCTCCTGAAAAGGGTGGCCTGCCCCCTGGTAAACCTGGCATACCCATTTGGTACCTTGGTCAAGAA 
TGATGTTTTTATGCCGCCCTGGGCTTTGACGCCGATTAAGCAAAGCTGTGTTCGCTCATCCAATACGTCCCTCGCCCAGT 
TAAACGACTGTTATGTATATGGGTGCTGCCGCTACGTAATACCTTGGCCCTACGCATACGAAGTTAATTCTGAAAGCGTT 
CAATGGACAATCTTCCTCCTCGGCGTCGACTGCAGCGGTAAGGTGATCTACTTTCGAAACACTGCAAGGGTAGGTCCTTT 
TTTGGCAGCGTCCATATACCGACCGTGGTATGGCTCAGATGCGCTGGTACTGCATTTCACCAAATAAAGTTGTGCTATAT 
CGCTCACGGCCGGTATTTCTATCGTCCAAGGCGACATATTGACGATTTCAAGGGTTACGTACCCGTCGAACCCCGGCTCT 
ACCTGACCGTaTTAaGGTTACCTGCACGAACMTCGCG(^(MGTTCCTTTGGTTTGCACCAGACCAmTAATCTM 
CGGTATTTTGTATCGATGTTTACTGCAGGTGATGACTTGCTCACCGGATTTCAGCACCAAATTTTGCTGAATTTCTTTTG 
GTTCCAAAAAAAACGTCGACGGGGCTGGATGCGACCCATATACAATGCTTGCAGAGGCGGGCAAAGATTTTGGCTCGTAA 
TATGTATGCCCCMTTGMTCTTGAGAGAAAACTCGTCAAATmGTTTTTTCACATATAGCGCATTGCGCAACAAGATG 
TTCTAGATTTTTGTCAATGACAATCATTTTGGCTTAACTTCGTTCGGCGATGGCTGAGCGTTTTGTTGTGACTGAGGAGG 
CTGCGACTGGTGTTCCAGCACAACCTTGTAATAGACGGCGTACATTATAAGAGAAGACACCAGGGAACTTCCGATTAAAT 
TCAGCGTCCACCCGACCCATATGGTGGGATTGAGCGAAAACCACTTCTTCAATCGCGCGCGCCGCGTGATGGGAAGCGTC 
GATATTTCACTGCCACGCTTGATGGGGCTTTGGGTGTGCGTCGATTCTGTGGAGAAAAAAATCACAGAGCCAAGCTTCTC 
GCCTTTGGTAAGGCAGACATTTTTGTTGGTGGTGTTGAATATCCTGAGCTTGAGCTTGCCATCAAATGCAGGTTCGACCT 
TCGCCGAAGCAACCAGCACGCCGCGTGAGAGAAAAAGACTTCCCGTAGGTAGGACTATGCCGTACCGATTGTGCGGCACC 
TTGATTTCTTCAGCGGCCTCCACCACAACAGAATCGTGACCCCTGATCGTCATACCATCGACAATGCGCCACAGGTTTTT 
GTCGTTGCQGAATAGTTATCACTCCACCCTTCACCCACGGACAATTCGAGGGAAAACTCTTCAATTCTCTCGTCCCCTG 
CATCAGTGAGTATAAGGCTGTTGCTGGAGTACGATGCCGCGTCAAAATCCGTGTGGGATTTCGTTGTACGCCCTTTGATC 
TGTAACACACTCATCAGATTTGTCCTTCGTCCTTGGGCTGCGAGACGCGGCCACCCTGCCATTGTCCTTTATACCGGCCG 
ATATCCCCGGATACCGCCTGAAAGATGACGTGCGCAAAGCGTGCACCAATCTGAATTTCAAACGCCTCGCTGTGATTGTT 
GGTGAGCGCGAACGTCATCGGCCCTACATAACCTGGAGGCAGCACACTGGAACTGAACGTTATCCCGCTTCTGAACAGCG 
TGCTTCTCGGAAAAAAAAGCGCCGCCAGATCTTCCGGCAGATCGAATTCTTCCATGGTGCTCGCCAGATAAGTCTTGCCC 
GGTTCCATGACGAAGCAGTQTCCGGGTCTGCGAGCACGACCTCGCTGGCAGGGGTGCGTCGCGTAGATTCTCGCAAGCT 
TCCACCCCCTACTGTCAGGCGAGAGAGGCCTGCGAGTCTC 

GGGCAAGGTGCTTGATTAGTTTTCCATCCCGGACCAGTTTCAGGAGCGAGTGCGGTGAGTAAATCATCTATTTTGCCTCG 
GGAAGGGCTCAGTCTATAGCACTGAGCAACCGCCTACGTTTACAGGCAAAAAGTCGGACTCAGTCGTTGTGGCCGCATTT 
TGCGTGTCCTGCGGGTTTAGTAGCMGGACmACCGTGCGACATGCGCAGCGCACTnCACTGCACGGACAGCCCCATC 
GGCATAGCAAGCCTTCACCGCCACTCTGATTGGGCTCACGTCCGACAAGAGTCGAGCAGCTTTGCAGGATTTTCCACATC 
ACCAAAACTGCCAATGGCAGCTAATGGCCGTTCTCTGCCTGTCGCCTCTTTGGCATGACTGGTCAAGTCGGATGCAAACG 
GTGGTCAGCACCAATGCAATTGGGTGGTCATGTGCGATGCAATTACGCAGTTGAGCCTGGCCCAGTTCCTCCCAAGCAAA 
GCATAAGACCAAGATGGCACATTGCCAACAAAATACCCTTCCCCGCTACCGTTGTTTTATCGTTGTTGCCAGCCCTGATC 
TGGCGGAAAAGCCCGCTCCATGAATCGTCATGGAGCCTCCCATGTTTCAACTCCTTTCCTGGATATCCAGGAAGCCGTCC 
CCCACCCCAACAACCAAAGCTGCCCCAGGGGGATTCATCCTTCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTCGCCG 
CCGGCAGCTACTGGAGAACATCTGGCAGCGCGCCTCGCTATCCAAGCAGCAATTCGAGGAGATCTACCGGCGGCCACTGG 
CCAACTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGATCAC 
GGCCTGGAGATCGTGGCCTACGCACTCAAGGTACGGCAGACCTACCTGCTCCCGATCGGCGCAGCGCCGGAGTCACAGTC 
AGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCGTCGTCGACCTGC 
AGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAG 
TCCCGCGAATACCAGCTCCACGGCGCTGCCTCAGCACTTCTCATCCACCAACTGCTACCGCGCACTGCACTCGATTGGCT 
CAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGCGAGA 
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TCATC6TGAAG6CAGACCA6GCCTCAGTTGCACAGGAGCTAGGAGGCAATCCG6ATCGAGCTCTGGCT6CACC6AAGCA6 
TCGCTGCAGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCAACCTAGCGGCCCGTC 
TGATGGAT(MCTGACCCAGGACGCACTCTGGCTGGTGAGCAAGCCTGCT(K;CGATCAACTGAGAGCCTACCTGCTGGCCC 
AGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCC 
GAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGCTGGATGGAGAAACAAGTTCACGCTACTCAAGATTGCTCC 
AGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCTACAGCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAA 
CGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACGAAGATGATGCTC 
CATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAGGCGATTGCGAAACCCTCAACTGATGATCAAGAAGA 
AACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACTCGCCGGCTG 
CCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGATTGCGCTCCTGAA 
GCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGC 
GGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAA 
TTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTG 
CAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTCTGGTCC 
TCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAA 
GCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGACGCCGCAGCAGCTCACCGAGGAGTACATCTTCGCGCACG 
ATCTCCGGGAAGCCAGCGCGAAGATCTACCGCGCCGCGACCAAGGCGCTGCTCAAGCACTTCGGTCCTACGGCAACCGTA 
CAGGACGTGGACCACCGGGCTGTCCTGGGATGGCGACG(^AGGTACTGGAACAAGGCCTGTCCAAGCGGAGCTGGAACAC 
GTACTCGAATCATCTGCGGACGATCTGGGGCTATGCCATCGAGCATGAGTTGGTGACGCACTCCCAAGTCAACCCGTTCA 
GAAAGACCACCGTCATTCCCCCCAGGCGAGCAAGCAAAACCGTCGCCGCCGAAGCCATCCTGCTCGCCCGCAATTGGCTC 
AACATGCAGGACGGCGCAGAGCGCTGCACCGGCGAACGCGCACGGATCACGCCCGCCTGGTTCTGGCTTTGCACGTTTGA 
GGTCTTCTACTTCACCGGCATCCGGTTGAATGCGCTGTTGTGCATCCGCAAGCGCGACATCGACTGGGAAAATCAACTGA 
TCCTCATCCGCGGCGAGACAGAGAAGACTCACAAAGAGTTCGTAGTGCCAATAACGGAGGGGCTTGTGCCTCACCTATCG 
AGGCTCCTGCAAGAGGCCGATAGAGCCGGATTCGCCGATGACGACCAGTTGTTCAACGTCAACCGGTTCTCACCGCACTA 
CAAGAGCAAGGTGATGAACTCCGACCAGGTCGAAGCCATGTACCGGAAGTTGACCGAGAAGGTTGGGGTGCGGATGACCC 
CGCACCGTTTCCGGCACACCCTGGCCACCGACTTGATGAAGGCACCCGAGCGGAACATTCACCTCACGAAGTGCCTGCTC 
AACCACTCGAATATCCAGACCACGATGAGCTACATCGAGGCCGACTACGATCACATGCGTGCCGTGCTGCATGCTAGAAG 
CCTGGCCCAAGGCGCGCTGGAGAATGTCAGGAAGGTGGATTACAGCGGCTCCCCGCAAGCCTCTGCCAAACCGAAGCCAT 
GCGGGCAACCTCTCGCTCGAGTGAGTGAAGCGCCGCCACCGGAGGCCAGGACAGAGCCTGCAGAACCAAGGGAGCACACG 
CCAGGGACAGGCATTCAGGGAGGTCCAACCGCGTGGGMGCAGATGCGCTACCACAGCCACCTGACACCTTCGAACCAAG 
CGTGCTGTTCACTCTGATGGCTCAAAACTTATCGAACCGTGCCGCCTCGGCATCCGCGGCTCCCGCTGCAACAAGCGGAT 
CAGGCGGATGGGGATCTGCCGCCCGAAGCAATCTCGCCTAGCGATACCGGTACTGAGGGCCGGCTACCGGACGAAAGGTA 
GCCGTGCCTTCCAGCAGATCGTTAGGCCTGTAGGAAAAATCTGGAATTACCGAGAGCGCCTGGATTCCAGCGCCGGCATG 
CTGGCAGAGCCAGCGCAATTTCAAGGCCAATACCACAGTACCCTCTGTAATCGCTGATTACGTCGGGGGCGCATTGCTAC 
GCCTGCAGAATGGTTTCAGGGCCTTAGAAACAGAAAAGCCCACCTAGAAAGGCGGGCTATTCCATATTGACATCACGTCA 
ATGCGGGCCTAATGTTCGGCCCAGACGGCTGCTAGACAAGAACCGGCGTAACACCCCTTCCTAGCCTATGCAACTCGCCC 
CGTAGAAAATGGTGGGTCGTGTAGGATTCGAACCTACGACCAATTGGTTAAAAGCCAACTGCTCTACCGACTGAGCTAAC 
GACCCAAGTATGAGGTGGTCGGGGTAGAGAGATTCGAACTCCCGACATCCTGCTCCCAAAGCAGGCGCGCTACCGGACTG 
CGCTATACCCCGATTGGAATTTGGCTCCGCGACCTGGACTCGAACCAGGGACCCAATGATTAACAGTCATTTGCTCTACC 
GACTGAGCTATCGCGGAACGTCTTTCTTCCAACCCTGGACGCTTCCGGTGTTGCTGGATTCGCGTCTCAGAGGCGCGCCA 
TTTTACGGATGCGCGCGGGCATGTCAACCCTCTGATCCAAAAAGTTTTTCTTCTTTTTCCACGAGCGACAAAACGGCCCT 
TCCACTGCATGCGGCAGCGCTCTCGCGCCTACCGGACGCCCATGAAAAAGCCCCGCCGAAGCGGGGCTTTCCCTGTCCGC 
CCCCGAAGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTCGCCGAGATACTGGCACCCGGCGCGAATTTGCCGGC 
CAGGATCAGTTGCGCCAGCGGGTTCTCGATCCAGCGCTGGATGGCCCGCTTCAGCGGGCGTGCGCCATAGACCGGGTCGA 
AGCCGACGGCAATCAGCTTGTCCAGCGCCTCCTGGCTCAGTTCCAGGCTCAGCTCGCGCTCGGCCAGGCGCTTGCGCAGG 
CGACCGAGCTGGATCTCGGCGATGCCGGCGATCTGCTCGCGAGCCAGCGGCTCGAACACCACCACTTCGTCGATCCGGTT 
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GAT6AATTCCGGAC6GAAGTGC6CATTGACCGCGTCCATCACTGCGGCACGTTGC6CCTCGCGGTCGCCGGCCA6CTCCT 
GGATCTGCGCCGAACCGAGGTTGGAGGTCATCACCACCACGGTGTTGCGGAAGTCCACCGTACGCCCGTGACTGTCGGTC 
AGGCGTCCGTCCTCGAGCACCTGGAGGAGAATGTTGAATACATCCGGATGGGCCTTCTCCACCTCGTCCAGCAGCACCAC 
CGAGTAGGGCTTGCGGCGGATCGCCTCGGTCAGGTAGCCGCCTTCCTCGAAGCCGACGTAGCCCGGAGGCGCGCCGATCA 
GGCGGGCCACCGAGTGTTTCTCCATGAACTCGGACATATCTATCCGCACCAGCGCCTCCTCGGTATCGAAGAGGAACTCG 
GCCAGCGCCTTGCACAACTCGGTCTTGCCCACCCCGGTCGGGCCGAGGAAGAGGAACGAGCCGCTCGGCCGGTTCGGATC 
GGCGAGGCCGGCGCGCGAACGGCGCACGGCGTTGGACACGGCGACTACCGCCTCGTCCTGGCCGATCACTCGCCGATGCA 
GCTCCTGCTCCATGCGCAGCAGCTTCTCGCGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTGGAAACC 
ACTTCGGCGATTTCCTCGTCGGTCACCTTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACCATCTGCAG 
GCTGCGTTCCAGGTCCGGGATGGTCTGGTACTGGATGCGCGCCATGCTCTCGAGGTCGCCCTTGCGCCGCGCCGCCTCCA 
TCTCCTGCTTGGCCTGCTCGATCTTCTGCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCC 
TCGAGGTCGGCGTATTCGCGCTCGAGCTTGACGATATCCTCCTCCAGCTTGGCCAGGCGCTTCCTGGTGGCTTCGTCGTC 
TTCCTTCTTCAGCGCCTCGCGCTCGATCTTCAGCTGGATCAGGCGACGGTCGAGACGATCCAGTTCCTCCGGCTTGGAGT 
CGATCTCCATGCGGATGCGGCTGGCGGCCTCGTCGATCAGGTCGATGGCCTTGTCCGGCAGTTGCCGATCGGTGATGTAG 
CGGTGCGACAGCTTGGCCGCGGCGATGATCGCGCCGTCGGTGATGCTCACCCCGTGGTGCACTTCATAGCGTTCCTTGAG 
GCCACGGAGGATGGCGATGGTGTCTTCCTCGCTCGGTTCGTCCACCAGCACCTTCTGGAAGCGGCGCTCCAGCGCGGCAT 
CCTTCTCGATGTACTGGCGATACTCGTCGAGGGTAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCAGAGCCGGCTTG 
AGCATGTTGCCGGCGTCCATGGCACCTTCCGCCTTGCCGGCGCCGACCATGGTGTGCAGTTCGTCGATGAACAGGATGAC 
CCGGCCTTCCTGCTTGCCCAGTTCGTTGAGGACCGCCTTCAGGCGTTCCTCGAACTCGCCGCGGAACTTGGCACCGGCGA 
TCAGCGCCCCCATGTCCAGGGCCAGCAGGCGCTTGTCCTTGAGGCCGTCCGGCACTTCGCCGTTGATGATGCGCTGGGCC 
AGGCCCTCGACGATGGCGGTCTTGCCGACGCCGGGTTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGCCGCTGCAGGAC 
CTGGATGGTCCGGCGGATCTCGTCGTCGCGACCGATCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTTGGTCATGTCGA 
CGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCGACGTTCGGGTCGTTCACCGCTTCGCCGCCACGCAGGTTGGCCACG 
GCATTCTCCAGCGCCTTGCGCGACACGCCCTGGCCGAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCCATCGCGGCCAG 
CAATACCAGCTCGCTGGAGATGAACTGGTCGCCCTTCTGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAGGCGTGCGAGAT 
CCTGGGACAGGTTCACGTCGCCGGTCGGGCTCTGGATCTTCGGCAGCGCGTCGAGTTCTTTGTTGAGGCCGCTGCGCAGG 
GCGGCGATATCGAAGCCGACCTGCATCAGCAGGGGCTTGATCGAACCGCCTTGCTGCTCGAGCAGGGCGGAAAGCAGGTG 
CACCGGCTCGATGGCCGGATGGTCATGGCCAACGGCCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAGCTTGCTGGTCA 
AACGGTCTATTCGCATGGGTCGTCCTTCCTTCTATAGAGCGGGCCGGAACGATGGGTGTCCCTGATGAAGAAAAGCCCGC 
CGAGATGACTCAGTAGATAAGGGCGATTTTCCGCGGTTCAAGCGACCGGACCGTGACATCGGTCAGTTGCCGCCGGATAA 
CCTGCGCGGGCCTAGTCCTGGAGCCAGACCAGGCTGGCAAAACGGCCGGTACGCGACGAGCGGCGGTAGGAATAGAAGCG 
CGCGGTATCGCTGAAGGTGCAGAAGCCGCCGCCATGCACGGCGGTGACGCCATGGGCGCCCAGGCGGATCCGCGCGAGTC 
GGTAGATGTCGGCCATGAAGCGGCCCGGATTGGCGCTAGGTACGAAAGCCGAGCGCGCCTCGGCGTGCGCAGCGACGAAT 
GCATCGCGGACCTCGCCGCCGACCTCGAAGGCCTGCGGGCCGATCGCCGGCCCCAGCCAGACCAGCAGTTCGTCGCCGGG 
CACGCCCAGGCTGTCCACCGTCGCCTCCAGCACGCCCGCCGCCAGCCCGCGCCAGCCGGCATGGGCCGCGGCCACCCGGG 
TGCCCGAGCGGTCGCAGAACAACGCCGGCAGGCAGTCGGCGGTCATGATCGTACAGGCGACGCCCGGCATCGCGCTCCAG 
CTGGCGTCGGCCCTGAGCACCGGTTCGGGTCGGCCTCCACCACGTCACTCCGTGCACCTATTCCAACCAGCTCGGCCGGC 
ATTCCAGACGCTCGGTCAGGCGTCGGCGGTTTTATTCCACGGCGCGCGGATCGTCGTAGACGTGGGCGCCAAGGTTCAGA 
CTGTCGAAGGGTGCCTGGCTGACCCCGCCACTGCGCGTGGTCACGCAGGCCCGCACACGGGCCGGCGCCGGCCAGTCGGG 
GGTCAGCCAGGCGTTCAACCGACGAACGCCTCGCGATCCTGGCGCAACAGGCTGAGCAGCCAGAGGAATTCTTCCGGCAG 
CGGCGATTCCCACTTCATGCGCACGCCGGTGGCCGGGTGATCCAGTTCGAGGAAGCGCGCGTGCAGCGCCTGCCGGGGGA 
ATTCGCGAAGAGTCTGGACCAGGGTCTGGCTGGCCACCGGGGGAATCCTGAAGCGCCCACCGTAGACCGGATCGCCGACC 
AGGGGATAGCCAATATGGCTCATGTGCACGCGGATCTGGTGGGTACGCCCGGTCTCCAGCTTGACCCGGGTATGGGTGTG 
CGCACGGAAGCGTTCCAGCACGCGGTAATGGCTGACCGCCACCTTGCCGGCGTCGACCACCGCCATCTTCTGCCGCTGCA 
CGCCATGCCGTCCGATCGGCGCATCGATGGTGCCGCCGGAGGTGATCACGCCGATCACGATCGCCTCGTAGATGCGGCTG 
ACCGACCGTGCCTGCAGTTGCGCCACCAGCTTGGTGTGGGCCTCCAGCGTCTTGGCCACTACCATCAGGCCGGTCGTGTC 
CTTGTCCAGGCGGTGGACGATCCCGGCGCGCGGCACATTGGCGATGTCCGGGACATGGTAGAGCAAGGCATTCAGCAGGG 
TGCCGTCCTGATGGCCGGCAGCCGGATGGACCACCAGGCCGGCGGGCTTGTCAATCACCAGGATGTGCTCGTCCTCGTAG 
ACGATTTCCAGCTCGATGTCCTGTGCGAGCCACTCGCCCTGGGCTTCCTGCTCGGCCTCCAGGACCAGTTGCGCGCCGCT 
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GTGGACGAT6TCGCGCGG6C6CAGCACGGCGCCGTCGACGGTCAGGCGACCGTCCTTGATCCAGCC6GCCAGACGGGAGC 
GGGAGTGTTCGGGAAAAAGCTGGGCGGCGATCTGGTCGAGACGCTGGCCACCCAGCTCGAACGGCACCTCGGCCGCGCGT 
TGAATCATATCGGACATGAGTAGGAGACGATGCTCAGCGCGGCTTTTGGAATCGGCTACGCGCTGTGGTTAAATACGGGG 
TCTTTGTCCCAGGGGGTGCCTGGGGCGCCAATCATAACAGACGGTTGAGGCCAAGCCGACCGTCCCAGGGACGCAAGCCG 
CCATGCAAGTGAAACACCTGCTGCTGATCGCCATCCTCGCCCTCACCGCAGCCTGCTCCTCGAACAAGGAGACTGTCGAC 
GAGAACCTGAGCGAGAGCCAGCTGTACCAGCAGGCGCAGGACGACCTCAACAACAAGAGCTACAACAGCGCCGTCACCAA 
GCTGAAAGCCCTCGAATCGCGCTATCCCTTCGGCCGCTACGCCGAGCAGGCCCAGCTCGAGCTGATCTACGCCAACTACA 
AGAACATGGAGCCCGAAGCCGCCCGCGCCGCCGCCGAACGCTTCATCCGCCTGCATCCGCAGCACCCCAACGTCGACTAC 
GCCTACTACCTCAAAGGCCTGTCCTCCTTCGACCAGGACCGCGGCCTGCTGGCGCGCTTCCTGCCGCTGGACATGACCAA 
GCGCGACCCGGGCGCCGCCCGCGACTCCTTCAACGAGTTCGCCCAGCTCACCAGCCGCTTCCCCAACAGCCGCTACGCCC 
CGGACGCCMGGCGCG&TGGTGTACCTGCGCAAOT^ 

CGCCA6GCCTATGTCGCCGCCGCCAACCGCGGTCGCTACGTGGTGGA6AACTTCCAGGAAACCCCGGCCGTCGGCGATGG 

CCTGGCGATCATGGTCGAAGCCTACCGTCGCCTGGGTCTCGACGACCTGGCCAGCACCAGCCTGGAAACCCTCAAGCTGA 

ACTATCCGGATAACGCCAGCCTCAAGGATGGCGAGTTCGTCGCCCGCGAAAGCGAGGCCGACACCCGCTCCTGGCTGGCC 

AAGGCCACCCTGGGCCTGATCGAAGGCGGCGAGCCGCCGCCGCACATGGAAACCCAGGCCGCCAAGGACGTGATCAAGCA 

GTACGAGGATGCCGAGCGGGAGATCCCCGCCGAACTGAAGCCGGAAAACCAGGATCACAGCGCCGACGACGAGAAGCCGG 

AGAGCGATGACGACGAGGACTCCGGTCGCTCCTGGTGGAGCTACATGACCTTCGGTCTCTTCGACTGATCGCACGAAACA 

CCGAAGGGAGGCGCAGGCCTCCCTTCnTTTGCCCGCCGCCATGCCTCTCCCAGCGCCAAACGCCGCACAGCCTGGACCT 

TCCCGTCTGGGATCGAGCCGGCGGCTTGGCTAAACTGCAGCTTTCTCCAGCCTCCGAGATCACCATGGGCCTTTTCCGCC 

TCCTGTTCTGGATCGCCCTGATCGCCATCGCGTTCTGGCTCTGGCGTCGCTTTACCCGTCCCACTCCGCGCCAGCAGCAA 

CGTCCGCAGGACGAGCCGAGCGCATCGCCGATGGTCCGCTGCGCCCATTGCGGCGTCCACGTGCCGCAGGCCAACGCCCT 

CGCCCACGMCAACGCTGGTAT1GCAGCCAGGCGCACCTGCGCCAGGACCAGGGCGACCGTGCGCGCTGAACGGCTACGG 

CTGAGCGAGGAGCAGGGGCAACGCATCCTCCGTCTGTACCACCTGTACCGCCTGACCATCGGCCTGGTACTGGTCCTGCT 

GATCTCCAGCGAACTGGAAGATCAGGTCCTCAAGCTCGTCCACCCTGAACTGTTCCATGTCGGCAGTTGGTGCTACCTGG 

TCTTCAACATCCTGGTCGCGCTGTTCCTGCCGCCGTCGCGGCAATTGCTGCCGATCTTCATCCTCGCGCTCACCGACGTG 

CTGATGCTTTGCGGCCTGTTCTACGCAGGTGGCGGCGTACCCAGCGGCATCGGCAGCCTGCTGGTGGTGGCGGTGGCCAT 

TGCCAACATCCTGCTGCGCGGGCGCATCGGCCTGGTCATCGCGGCGGCGGCCAGCCTCGGCCTGCTCTACCTGACCTTCT 

TCCTCAGCCTGAGCAGTCCGGACGCCACCAACCACTACGTCCAGGCCGGCGGCCTCGGCACCCTGTGCTTCGCCGCCGCG 

CTGGTGATCCAGGCTCTGGTGCGGCGCCAGGAGCAGACCGAAACGCTGGCCGAAGAACGCGCCGAGACGGTCGCCAACCT 

GGAGGAACTCAACGCATTGATCCTGCAGCGCATGCGCACCGGCATCCTCGTGGTCGATAGCCGTCAGGCCATCCTCCTCG 

CCAACCAGGCCGCCCTCGGCCTGCTCAGGCAGGACGACGTGCAGGGCGCCAGCCTCGGCCGCCACAGCCCGATGCTGATG 

CACTGCATGAAGCAATGGCGCCTGAATCCCAGCCTCCGTCCGCCGACGCTCAAGGTGGTGCCGGATGGCCCGACGGTGCA 

ACCCAGCTTTATCAGCCTCAACCGCGAAGACGACCAGCACGTGCTGATCTTCCTCGAAGACATTTCGCAGATCGCCCAGC 

AGGCGCAGCAGATGAAGCTGGCCGGTCTTGGCCGCCTGACCGCCGGCATCGCCCATGAGATCCGCAACCCGCTGGGCGCG 

ATCAGCCACGCCGCCGiACTGCTGCAGGAGTCAGAGGAACTGGATGCCCCGGACCGACGCCTGACGCAGATCATCCAGGA 

CCAGTCGAAGCGGATGAACCTGGTCATCGAGAACGTCCTGCAGCTCTCCCGTCGCCGCCAGGCCGAACCGCAGCAGCTCG 

ACCTGAAGGAGTGGCTTCAGCGGTTCGTCGACGAATACCCCGGCAGGCTGCGCAACGACAGCCAACTGCACCTGCAGCTC 

GGTGCCGGCGACATCCAGACCCGCATGGACCCACACCAGTTGAACCAGGTGCTGAGCAACCTGGTGCAGAACGGTCTTCG 

CTACAGCGCCCAGGCGCACGGGCGCGGCCAGGTCTGGCTGAGCCTCGCGCGCGACCCGGAGAGCGACCTGCCGGTGCTGG 

AAGTCATCGACGACGGTCCCGGCGTACCGGCGGACAAACTGAACAACCTGTTCGAACCCTTCTTTACTACAGAAAGCAAA 

GGCACCGGCCTGGGCCTCTATCTCTCCCGCGAACTCTGCGAGAGCAACCAGGCACGGATCGACTACCGCAATCGCGAGGA 

AGGCGGCGGCTGCTTCCGCATCACCTTCGCCCACCCGCGCAAACTCAGCTGACGGAAGCCGCACGCATGAGCCGACAAAA 

AGCCCTGATCGTCGACGATGAACCGGATATCCGCGAACTGCTGGAAATCACTCTCGGCCGCATGAAGCTGGACACCCGCA 

GCGCCCGCAACGTCAAGGAAGCCGCGAGTTGCTGGCCCGCGAGCCGTTCGACCTGTGCCTCACCGACATGCGCCTGCCGG 

ACGGCAGCGGCCTCGATCTGGTCCAGTACATCCAGCAGCGCCATCCACAGACCCCGGTGGCCATGATCACCGCGTACGGC 

AGCCTGGACACCGCGATCCAGGCGCTCAAGGCCGGTGCCTTCGACTTCCTCACCAAACCGGTCGACCTCGGTCGCTTGCG 

GGAGCTGGTGGCAACCGCCCTACGCTTGCGCAACCCGGAAGCCGAGGAAGCGCCGGTGGACAACCGCCTGCTCGGCGAGT 

CGCCGCCGATGCGCGCCCTGCGCAACCAGATCGGCAAGCTGGCGCGCAGCCAGGCGCCGGTCTACATCAGTGGCGAGTCC 

GGCAGCGGCAAGGAACTGGTGGCGCGCCTGATCCACGAGCAGGGGCCACGTATCGAGCGGCCGTTCGTGCCGGTGAACTG 

CGGCGCGATTCCCTCCGAGCTGATGGAAAGCGAGTTCTTCGGCCACAAGAAAGGCAGCTTCACTGGCGCTATCGAAGACA 
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AGCAGGGCCTGTTCCA6GCCGCCAGCGGTGGCACCCTGTTCCTCGACGAAGTCGCCGACCTGCCGATGGCCATGCAGGTC 

AAACTGCTCCGGGCGATCCAGGAAAAGGCCGTGCGCGCGGTCGGCGGCCAGCAGGAGGTCGCCGTCGCACGTGCGCATCC 

TCTGC(K:CACCCA(MGGACCTCGCCGCCGAAGTCGGCGCCGGGCGCTTCCGCCAGGACCTCTACTACCGCCTCAACGTC 

ATCGAGCTGCGCGTACACCGCTGCGCGAACGCCGCGAGGACATCCCGCTGCTCGCCGAACGCATCCTCAAGCGCCTGGCC 

GGCGACACCGGCCTGCCGGCCGCCAGGCTGACCGGCGACGCACAGGAGAAGCTGAAGAACTACCGCTTCCCGGGCAACGT 

CCGCGAGCTGGAAAACATGCTGGAGCGCGCCTATACCCTGTGCGAAGACGACCAGATCCAGCCTCACGACCTGCGCCTGG 

CCGATGCGCCGGGTGCCAGCCAGGAAGGCGCCGCGAGCCTGAGCGAAATCGACAACCTCGAGGACTACCTGGAAGACATC 

GAGCGCAAGCTGATCATGCAGGCACTCGAGGAGACCCGCTGGAACCGCACCGCCGCGGCCCAGCGCCTGGGCCTGACGTT 

CCGCTCGATGCGCTACCGCCTGAAAAAGCTGGGCATCGACTGAAAGTGAAAAGGCCTGTCCGAAGACAGGCCTTTTGGTT 

TTCGCTCCTCAGAGGCGACCAGCCGGGGCGTAGGGGGCCGGGTCGATGATCGGTTCCCGCCCGCTCATGAGATCCGCCAG 

CAGACGGCACGACGCCGGTGCCAGGACCAGCCCGTTGCGGTAGTGCCCGGTATTCAGCCAGAGCCCGTCGAAGCCAGGCA 

CCGGACCGATATAGGGGATGCCTTCGGGAGAGCCCGGGCGCAACCCTGCCCAGTGGGCCACCGGCTGCATGTCCGCCAGT 

TCCGGCAACAGTTCTGCCGCAGACGCCCTGAGGCTTTCCAGCGCCTCGTCGGTCGGCGTCTTGTCGAAGCCCGAATGTTC 

CAAGGTGCTGCCGATCAGGATGTGGCCGTCGCGCCGCGGAATCGCGTAGCGCCCCTTGGCCAGCACCATGCGCGGCAGGA 

AATCCGCCGCGCACTTGTAGAGGATCATCTGACCTTTCACCGGTACCACGGGCAGTTCCAGGCCAAGCGGCTTCAACAAC 

TCGCCGCTCCAGGCGCCTGCCGCCAGCAGCACCTTGTCGCCACGGATCTCGCCACGCGAGGTCGCCACGCCGACCACTCG 

ATCGCCGTCGCGCAACCAGCCGCGCACCTCCGTCTGTTCATGCAACTCGAGATTGGCGAATTGTTGCAGGGATGCCCGCA 

ATGAGCGCGCCAGGCGAGGATTGCGCACATTGGCCACGCCCGACATGTAGACCGCCCGCTGGAAGCCTGCGCCCAGCCCG 

GGCACCGCCGCGTAGGCCTCCTCGATCGGCACTTCCTTCAACGGCCGGGTGTGGTTGCGTGCCCACTGCAGTGCCTCGGT 

CTGGTCGTCCAGGTCCAGCCAGTACAG6CCAACGGTATGGACCTCGGGATCGAGCCC6GTCTCGTCGAGCAAACGCTGCC 

CCAGGGCCGGGTAGAAGTCCTGCGACCAGTGCGCCAGGGCGGTCACCGCCGGGCTGTAGCGCCACGGATAGAGCGGCGAG 

ACGATCCCGCCTCCCGCCCAGGATGCCTCACGCCCACTCTCGCCCCGCTCCACCAGGGTCACCCGCAGTCCGGCGAGCGC 

CAGCTCCCGGGCGGTCAACAGGCCGATGACGCCAGCGCCTACCACTACTACATCTCTACTCACCACAGGGCTCCTACCGA 

TTTGCCAGGAACAGAGAAATATCACTCAAAGGGATCAGATGCTGACGAATTGCCTGCTTCAACGAACTCAGTCGAATCTA 

GTCCCGGTGAAAAGCCCATCATACCCGCAGAGGTATTCATCCCATGAAATCGAGTGGTTTGAATTTGGTGGAACTATCGA 

TAGTCCTATCGATCCTTGC6ATAGGCGTGACAATTGCGCTGCCCACCCTCCCCGACA6AATGAAGCGGGACATTAGCCGT 

GATATTGGTGACAGCCTGACTAGTCATGTGATGGCTGCGCGGGCTAGCAGCATACAGAACGGCGTGATCATCGAGGTGTG 

CGGTAGCGGTGACGGCAGTACCTGCAGCGAGGAATGGCATCTCGGCTGGTTCAGCCGTAACGACAGGAGCCAACAGATAC 

TGGCCCGGCATGAAAATACGAGTCGCACCGATATTCATTGGCGGGGCTTCGACAAGCGACTGCGCTACCTGCCTAATGGC 

ACCAGCCCTAQGGTMCG6GCGTTTCTTCGAATGTMGGACGATCGCATCGAGTGGCAATTGGTGCTCAATCGGCAAGG 

CCGCCTCAGGGTGGCGGGAAAGAGCGAAAATAAAAAGCTCTCTTACCTGTGCTCCAGGCGGTGAGAGAACTGTTTCACAT 

ACCGTTTGCCAGTCATCCCACTCTCCGCTCCGGCTGTCTCTGCTACAGGGACAATGCGCTCTCCACTAGGCAAGATTATC 

TGGCCCTTTTCCTTGTGGAGTACTGCATGCGCTCTATTTGTCGCAGCGCCGGCTTTTCCCTGATCGAGTTGATGATGGTG 

TTGGTTCTGGTCGCCATATTCGCCAGCATT6CCGTACCCAGTTTCAACGCCTTGATCGAGCGCAACCGAATCCAGACTGC 

CAGCGAGGAACTCTACAGCCTGCTTCAGTACGCTCGCAGCGAAGCTGTAAACCGTCATGCCAATGTGAGCATCAGGGCGA 

CGCAGAACAATGACTGGGCAAAAGGCCTGGAMTCATCAGCGGCGCGACQCCGTGCAAMGCACCMGGTT^ 

GTCTCGCTATCCGCCAKAGTKGACTGCGGAGCTGACCTTCAACGCTACCGGCACACTTAGCAACCAIMCTGCAAACAT 

TGACATAAAGGTCTGCTTCGCCGGTGACAAAAGTACAGGACGTCTGCTTACCGTTCAGCCCAGTGGACGCGTGATCCTGT 

ACCCATCTTCAAAGCAACCGGACAGCTGTAACTGAGGAAAGCCCATGTCTCGAGAAACGGGTTTCAGCATGATCGAAGTA 

CTGGTTGCTCTGGTGCTGATCAGCATTGGCGTACTGGGCATGGTTGCCATGCAAGGGCGCACGATCCAGTACACGCAGGA 

GTCGGTACAACGCAATGCCGCAGCAATGCTTGCTAGCGACCTGATGGAAATAATGCGTGCGGACCCAGATGCCGTACTCA 

ATCTACGCGCCCAACTACGCGAAGACTCGGTCTACTACAAGGCCAAGGGCAGCGACTTTCCCGCAGCCCCAGCGCGCTGC 

GCGCCATTGCCAGCAGATGCTAAGGAACGTCTCGGCTGCTGGGCCCAACAGGCCTCGAAAGACTTGCCGGGAGCCTCCGC 

ACTCTTGAATAGCCAATTCTACATTTGTCGC&GCC 

TCCAGGTTGCCTGGCGAGCCATGGATGGAGCGTGTTTCAACGCCTCTGACTCCACCTTGTGCACCTACAGCGTCCGCTCC 
GAATTGTGAGAACAAGCATGCTCTTCAGCAAAATGCAGAAAGGCCTATCGATGGTAGAACTGCTCGTGGCACTCGCTATA 
AGCAGCTTCCTGATCCTGGGGATCAGCCAGATCTACATCGACMCAMCGCAACTATCTTTTCCAGCAAGGCCAGGCCGG 
CAACCAGGAAAATAGCCGCTTCGTTCTTATGCTGCTGCAGCAACAACTGGATAAGACAGCCTATCGTCGCCTTCACGACG 
ACAACATGGAGAATGCTTTCAAATCCGCGACATTCAATGGCTGTCGTGCATTTGTGGCTGGCGAGACTATCGCTGCGGCA 
ACTGCCCTCAAGGCGGGTGAGTACGGTGTCTGCTTGCGCTATCAACCCGCCTACAAAGGGGAGCATGATTGCCTCGGTAA 
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TGAMTTACCG6AGTTCCG6AAAAGCCCTTCACAAATACTCCCCCTGTCGTCGTTCGCCTGGTCTACCTACCGAGCGCCG 
6TACCCTGAGTTGCAGTCGTCCC6ATATCGCCCAGTC6AAATCGGGAGAATT6GTCAGTGGTCTCACAGACTTCC6CTTG 
GAAGCGGGGGTCGGGCCAGCAGATCGTAGCGAACGCAAAGTATCCAGCTTCGTCGCACTACAGGATGTCGCCGGTCGTCC 
TATCCGAGCATTGCGCTTCTCAATCCTGGCAGGCAGCGACAATACAAGCCTGCGCACAGGAGATGATAGCCAGGCACGCG 
ATCGCTGGATCGTCCTTTATCCCGAGAGCAAAAGCGCCATCGAGGCCGCAGACAAAGGCCAGATTTACCAMTAGCGCGT 
GGTAACCAAACCATCAGGAATCTCATGCCATGACCCTGCGCCATACCTCTCGACAGCAGGGATCCACGTTGTTGATCTCG 
CTGGTTATCTTGTTGATGATCACGCTCCTCGCCGTTTCCAACATGCGCGAGGTGTCACTGGAAAGCCGTATCACCGGCAA 
TCTCATCGAACAGAAGCGCCTGCGCAATGCGGGCGAAGCTGGGCTACGCGAAGGTGAACGACGCTTTTTCAATACCATCA 
AGCCCCCAGAGGTCGGCAGCGGATGCGCCGATAGCAATGTCAAACGGCCTTGCATACTGAACCTGAGTGCCCTCTCCGTA 
CCCCGAGATGACGTGCACAACAATCCGGTGGCAGCCCTGAACGGCAAGACAGATAACGCCAATTCACGTGTCTGGATGCC 
CTACCGAGGCAGCGATCTGAATAACCCTACGCAGATCGACAAAGACCGCGCAGTCACCTGGCAGACCATCACGGTGCCCG 
CTGGCGAACAGAACAACGAAGCGGAAAATCCCGAGTACGGCAACATGATGCGCGGGGTCGGCACGTTCTACTACGAAACC 
AACAGCCGCGCCCTCAACAAGGCGGGCGGAGAGACTGTTCTACAGGCCGTTCATGCACGCCTGTATACCAACTGACTGGA 
GCCAGCGCATGATCCACCAGATTACCCGCGCAGGAAAAAGCCTGCTGGCTGCAGGTTGCACCCTGAGCATCCTGTTCGCC 
TCTGACAGTTATGCCGCCACGGCCCTGAATGTCAGCCAGCAACCCCTGTTCCTAACCCAGGGCGTTGCTCCCAACCTGCT 
GTTCACTCTAGATGACTCAGGCAGTATGGCCTGGGCTTACGTGCCCGACGGTATTAGCGGGAATAGCGGCAGAGCGGGAC 
GTTCCAGCGATTACAACGCACTGTACTACAACCCCGATTATGCTTACCAAGTGCCCAAGAAATTGACACTGTCAGGCGAT 
CAGATCATCGTTTCCGACTATCCAGTGCCACGCTTCACAGCAGCCTGGCAGGATGGCTACGCCCAAGGCTCCACCACCAA 
CCTGAGCAATAACTATCGCCCTCAATGGGGAACCGGCTGGCTTGGTTGCATCGATAGCAGCTGCAATACCGGGAGAGCTT 
ATTACTATACTTATAAGGTAAGCGCTAGCTGCCCTGCACAGCCGGTGAGCAGCTCCAACTCCTGTTATACCTACAATGCT 
CTTCCTACCAGTCAGGAAAGCAACTTTGCGATATGGTACTCCTACTATCGCAACCGCATCCTGGCCACAAAGACCGCTGC 
CAACCTGGCCTTTTACAGCCTGCCGGAAAACGTGCGTCTCACTTGGGGGGCCCTGAACACCTGTAGCATCGGCGCCAACA 
GCAGAAGCTGCCAAAACAATGCCCTGCTCCAATTCAACMGQ^ 

CCGGCCAGCGGCGGTACTCCTCTGCATGCGGCTCTTGACCGAGCCGGACGCTTCTTGCAAACCAACGGCACAGCTTATAC 
CACCGAAGACGGAAAGACATATTCCTGCCGGGCCAGCTATCACATCATGATGACCGACGGTATCTGGAACGGTCGGAACG 
TCACCCCCGGCAATCTCGACAACCAGAACCAGACCTTTCCTGATAGCACCCTCTATAGGCCACAGCCCCCTTATGCCGAC 
AGCAATGCCAGCTCATTGGCTGACCTGGCTTTCAAATACTGGACCACAGACTTACGTCCCAGCATCGACAATGACCTGAA 
GCCTTTCATGGCCTACAAGAGTGGGGACGATTCCAAGGATTACTGGGACCCTCGCAACAACCCAGCCACTTGGCAACACA 
TGGTCAACTTTACCGTTGGCCTAGGTCTTTCCTATTCGCTCACATTGAACTCTGCACCAACTTGGACAGGCAGCACCTTT 
GGCAACTACGAGGAGTTGATGGCTGGAAGCAAGGCTTGGCCCAGCGTCGATAACGACGCCGCACCCGGTAACGTCTACGA 
CCTCTGGCATGCAGCTATCAACTCTCGTGGAGACTTCTTTAGCGCGGAATCACCGGACTCTCTGGTTCAGGCTTTCAATA 
AGATCCTGACACGGATTTCCGAGCGCAACACCTCCTCCTCCAAACCAGCAATGACTTCCGCGCTGCAGGATGACGGAACC 
GGCGACMGCTGATCCGCTACAGCTACCAGTC(^GCTTTGCC^GTGACAAGAACTGGGCGGGCGACCTTATACGTTACAA 
GGTGGAGTCGACTTCCACCGGTTCGACCAAAACCCAGGAATGGAGCGCCGGCGCACTGCTGGACAACCGAGCTCCCGCTA 
CCCGTAATATTTACATCGCCAGCAATAGCGGAACCAACCGCCTTAAGCCTTTCACATGGAGCAATATTGAGGGAAGTCAG 
TTAGCCACTTGKTGAACCGCAACCCGGACAAGGAC^TCAGGCCGACACCmGGAGQQGCG(MTCGACTTCATCCG 
TGGCCAGCAGAATATGGATGGATTCCGGCAACGACAGGCGGTGTTAGGGGACATCGTGCACTCGTCTCCAGCCGTGGTCG 
GACCGGCCCAATACCTCACTTATCTGGCCAACCCCATCGAACCCAGCGGCGACTACGGCACATTCAAGACAGAGGCAGAC 
CAGCGCAGCCCTAGAGTTTATGTTGGATCCAACGATGGCATGTTGCATGGTTTCAACATCAAAACCGGCGTGGAAGAGTT 
CGCTTTCATCCCTACAGCAGTATTCGAAAAGCTTAACAAGCTTACCGGCATCAGCTACCAGGGCGGTGCCCACCAATATT 
TCGTCGACGCTACACCGGTCGTCAGCGATGCCTTTTTCGATGGAGCTTGGCACACTGTTCTGATCGGAACGCTTGGTGCT 
GGAGGTCGCGGCCTGTTCGCACTCGATGTAACCAAGCCGGACGATGTCAAGCTGCTTTGGGAATACGATAGCAGTACCGA 
CTCGGACCTTGGTTACACCTTCTCCAAACCTACCGTAGCCAGACTGCACAGCGGACAATGGGCAGTAGTTACCGGCAACG 
GCTATGGAAGCGATAATGACAAGGCAGCTTTACTGCTGATTGATTTGAAAAAGGGAACGCTGATCAAGAAGCTGGAAGTC 
CAAAGCGAGCGCGGAATAGCCAATGGCCTATCGACGCCTCGCCTGGCTGATAACAACAGCGATGGCATTGCTGACTACGC 
CTATGCTGGCGATCTGCAGGGAAATATCTGGCGCTTCGATTTGATCGGCAATACCCGCAACGACGACCCAGACACAAATA 
CCTCTATCAATCCCTTCAAGCCCGGAGATGTAGATCCTTCTGCTTTCAGAGTATCGTTCAGCGGCGCCCCGCTTTTCCGT 
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GCTC(K:6CCGACAACAATACTCGTCAGCCCATCAC6GCTCC6CCTACCTT6GTAC6CCATCCTAGCCGTAA66GCTACAT 
CGTCATCGTAGGTACAGGAAAATACTTCGAGGACGATGACGCTCAGGCCGATACCAGCCGAGCCATGACGCTCTATGGTA 
TCTGGGATCGCCAGAC(MGGGCGAAAGCGCAAACAGTACCCCAACCATCGACCGCAACGCCCTCACAGCCCAAACCATG 
ACAACAGAGGCGAACTCCACATTCGGTAGCGTGAACAGGAATATTCGGCTTATTAGCCAAAACCCGGTGAAGTGGTACAA 
AGAC(MAGCAACC(MTACCGCGAACTCGGATGTGGCTAGCTATGGCTG(X;GACTGAATCTGGAGGTCAATAGCAGCAAGA 
AAGGCGAAATGATGATCGAAGATATGTTCGCTGCCGGCCAAGTGCTTCTATTGCAGACCTTGACACCGAACGACGACCCT 
TGTGAQGCGGCTCTACCAGCTGGACCTACGGCCT(^TCC^TATACTGGCGGACGTACCAGTTTCACCGTCTTCGATCT 
CAAACGTGCGGGTATAGTGGACTCTGGCTCGGATTACAACGGCTCGGTCGTATCCGCCTTCCAACAGGATGGACTAGGTG 
GCTTGGCCATTACCCAGAACGAACAGCGTCMTCCGAGGCTTGCACTGGTGATGAGTGCATCATCTTCAACCCCAGCGAC 
MGAGTMCGGACGACAAACCTGGCGGGTCGTCGAGGAGAAATGAACATGAACCCCnACGTCTTCTCGCC^CAGCTCTT 
GCAGCTCTAGCTCTGGCTTGCCCAACCTTTGCCTTGAGTGCCACGAATACGTTCGAGAATGTGGGCGTGGTCGAGGATGT 
TCATCCTGCCGCCGGTCTGGTAGTAGTCGATGGGCAGACATATCGCTTGCCCAACCGTGTCCAACAACAGGACTCGCCGG 
TCATATTCTTGGTACGTCAGGGACAGACAGTGTCTTTCTCCGGCAAACTCACCAGCGACCTGCCAGAAATCGAGTCGTTC 
TACATTATCAAGCAGGCCCCTCTCGTTCCCTTCGGATCGGAGCAGCAACAATGAAGTCGAACAGAGGCTTCACTCTCATC 
GAGTTGATGATCGTCGTAGTAATCATCGCTATTCTTGCTGGTATCGCCTACCCCAGCTACGACGAATACGTGAAGCGCGG 
GAATCGCACCGAAGGACAGGCATTACTQGCGAAGCA^ 

TCACTACCCAAGCCGACATCGGCAAGCTGCATATGCGCAACACATCGGGCACCACAGTGAAGTCCTCCACAGGCAAATAC 
AGCCTTACCGTCGATACGGTAGCCAACGACGGAGGTTATCGCCTTATCGCTAACCAGGCATTCAACGATCTTGATTGTGG 
CAACCTGACCTTGACCGCCAACGGCGAGAAAGGCCGGACTGGAAGCAAGAAGAGCGTTGCAGAATGCTGGCGCTAAAGCG 
CCGAGACAAGAAAAAGGCAAAGCCCGGCATAAGCCGGGCTTTTTCAGGTGCGCAAAAATTCCGATTACAAAGCCTTGACC 
CGCAGTTCCTTGGGCATCGAGAAGGTAATGTTCTCCTCCCGTCCCTCCAGTTCCTGCTCTTCCGACGCCCCCCACTCACG 
TAGCTGGGCGATCACTCCGCGCACCAGCACTTCCGGCGCGGAGGCGCCTGCGGTGATTCCGATGCGACGCACACCGTCGA 
ACCAGCCGCGTTGCATGTCCTCGGCGCCGTCGATCAGGTAGGCCGGCGTGCCCATGCGCTCGGCGAGTTCGCGCAGGCGG 
TTGGAGTTGGAACTGTTGGGGCTGCCCACCACCAGGACCATGTCGCACTGGTCGGCCAGTTCCTTCACGGCATCCTGGCG 
GTTCT(3GGTG(3CATAGCA(»TGTCGTTCTT(K:GCGGCCCCTGGATCTGCGGGAACTTGGCGCGCAGGGCATCGATGACCT 
TCGAGGTGTCGTCCATCGACAGGGTGGTCTGGGTCACGTAGTGCAGGGCTTCGGGCTTGCGCACCTCCAGCGCGGCGACG 
TCGGCCTCGTCCTCCACCAGGTAGATGGCACCGCCGTTGCTGGCATCGTACTGGCCCATGGTGCCTTCCACCTCGGGGTG 
GCCTTCATGCCCGATCAGCACGCATTCGTGGCCGTCGCGGCTGTAGCGCACCACTTCCATGTGCACCTTGGTCACCAGCG 
GGCAGGTCGCGTCGAAAACCTTCAGGCCGCGCCCCTCGGCTTCCTTGCGGACCGCCTGGGAAACGCCGTGGGCGCTGAAG 
ATGACGATGACGTTGTCCGGCACCTGATCGAGTTCCTCGACGAAGATGGCGCCGCGCTGGCGCAGGTTGTCCACGACGAA 
CTTGTTGTGCACCACCTCGTGACGCACGTAGATCGGCGGGCCGAAGACATCGAGGGCACGGTTGACGATCTCGATGGCGC 
GATCCACGCCGGCGCAGAAGCCGCGGGGATTGGCGAGTTTGATTTGCATGGCGGTCTCGTGGGCGACGCGGTGATTGGAC 
GAATGAACCTTGCTACCGCCCTCCCCGCTTGGGAAGGGCGCAGCGACCGACGGTTCAGGCCGGCTGGACGTCGAT 



Fig. 2K 



15/133 

>0RF2 (SEQ ID N0:2) 

TCGCC6ATCCAATGCCAAG6A6TACCT6G6CAATCA6A6CCTACTCAC6GCTGCCGGGGCCGGCATTGCCAAGCTCCTG6 
ACGCCGACGAGAACAACACCAGTACCGTCTTCAGCGGCAACGGCACCAGCTTCGGGACGACCGGAACCAACAGCAACTCG 
GCCCTCAACAGCATCCTCTCCGGCGGCGTCAGCGACATCCGGCAGTGGATGAACAAGTTGTACGGGGAGGCCTTCGCCGC 
CGTCTACGTGCAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTGGCGATCGACTATGA 

>0RF3 (SEQ ID N0:4) 

CGCCGATCCAATGCCAAGGAGTACCTGGGG^ATCAGAGCCTACTCACGGCTGCCGGGGCCGGCATTGCCAAGCTCCTGGA 
CGCCGACGAGAACAACACCAGTACCGTCTTCAGCGGCAACGGCACCAGCTTCGGGACGACCGGAACCAACAGCAACTCGG 
CCCTCAACAGCATCCTCTCCGGCGGCGTCAGCGACATCCGGCAGTGGATGAACAAGTTGTACGGGGAGGCCTTCGCCGCC 
GTCTACGTGCAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTGGCGATCGACTATGAACTCAAGGGCCGCAA 
GGTCGATTACAGCTCTGGAGCCGCTCATGCAACAGCAGACTTGGACTAA 

>ORF602c (SEQ ID N0:6) 

TCGGCCTGGTCATTCGCGGAGGCATCCTGCTGCGGATCGATTGGGCGCCGCAGCGTTTGCCTGGCATCGAGGAGCAGCCG 
GCCACGGCTGTTGCCTATCGAGCTGGTCGCACCTCGCTCCCAGACGTCGAGCATGTTGGCCTCGCCGTGGGGCAGCATCT 
CCTCCTTGCTGGTGGAGCACGCTGCCAGGGTCAGCGCGCAGGCCAGGCCGGCGCAGAGACGAAGAAGGGGGTTAGTCCAA 
GTCTGCTGTTGCATGAGCGGCTCCAGAGCTGTAATCGACCTTGCGGCCCTTGAGTTCATAGTCGATCGCCAGTTGCTGAT 
CGAGATGCACTGCGACCCGCGCACCTGGCTGCACGTAGACGGCGGCGAAGGCCTCCCCGTACAACTTGTTCATCCACTGC 
CGGATGTCGCTGACGCCGCCGGAGAGGATGCTGTTGAGGGCCGAGTTGCTGTTGGTTCCGGTCGTCCCGAAGCTGGTGCC 
GTTGCCGCTGAAGACGGTACTGGTGTTGTTCTCGTCGGCGTCCAGGAGCTTGGCAATGCCGGCCCCGGCAGCCGTGAGTA 
G 

>ORF214 (SEQ ID N0:8] 

ACAAGTTGTACGGGGAGGCCTTCGCCGCCGTCTACGTGCAGCCAGGTGCGCGGGTCGCAGTGCATCTCGATCAGCAACTG 
GCGATCGACTATGAACTCAAGGGCCGCAAGGTCGATTACAGCTCTGGAGCCGCTCATGCAACAGCAGACTTGGACTAACC 
CCCTTCTTCGTCTCTGCGCCGGCCTGGCCTGCGCGCTGACCCTGGCAGCGTGCTCCACCAGCAAGGAGGAGATGCTGCCC 
CACGGCGAGGCCAACATGCTCGACGTCTGGGAGCGAGGTGCGACCAGCTCGATAGGCAACAGCCGTGGCCGGCTGCTCCT 
CGATGCCAGGCAAACGCTGCGGCGCCCAATCGATCCGCAGCAGGATGCCTCCGCGAATGACCAGGCCGACTACACCCGCA 
CGGCCAGCAACGAGATCCACAGTCAGTTCAAACGACTGCCCAATCCCGACCTGGTGATGTATGTGTTCCCGCACCTGGCC 
GGCAGCGATCCCGCCCCGGTACCGGGCTACACCACCGTGTTCCCCTTCTACCAGCGAGTCCAGTACGCCATGCCGGGCGA 
ACGCACGGAGGACTATTGA 

>ORF1242c (SEQ ID NO:10) 

TCTCGTCCTGGGCGTAGAACTGGACAATCCAGGGTGAGGTTTCGTGCTCGTCGAAGGAGTTCTGCAGGGCTTCTTTCAAT 
GCGTCCC(3GGCGTTCT(X:ATC(^TTGGGATCGCGGCCCTCGGTGCCCMGGGCACCAGTTC(3WGAATGCGGCGCGCGA 
ACGCCCATCCTCCAGGAGCATCACTTGCTCGTCGGGCAGGTACTCAGCCCAGGGCAGCAGGTCTACGAACGATGGGTCGT 
GATCGTAGAGGCGTGACGCTTCGGCCTGCGTGGCGCCATTCTTGCTCCCGGTGTTGGGCAGAGGAATACCCATGGCGGCC 
AGTCGCGCCAGATAGCGCTCAGTCGCTTCTTCCGCGGCCGCTACGTCCAGCGCTCCTGAATCTTCGGGAGCGTCTGCCGG 
TACCGACTGAGGCTGTGTGCGACCGCGCAGAAGGGTTTGAAAAAAGCCCATCAATAGTCCTCCGTGCGTTCGCCCGGCAT 
GGCGTACTGGACTCGCTGGTAGAAGGGGAACACGGTGGTGTAGCCCGGTACCGGGGCGGGATCGCTGCCGGCCAGGTGCG 
GGAACACATACATCACCAGGTCGGGATTGGGCAGTCGTTTGAACTGACTGTGGATCTCGTTGCTGGCCGTGCGGGTGTAG 
TCGGCCTGGTCATTCGCGGAGGCATCCTGCTGCGGATCGATTGGGCGCCGCAGCGTTTGCCTGGCATCGAGGAGCAGCCG 
GCCACGGCTGTTGCCTATCGAGCTGGTCGCACCTCGCTCCCAGACGTCGAGCATGTTGGCCTCGCCGTGGGGCAGCATCT 
CCTCCTTGCTGGTGGAGCACGCTGCCAGGGTCAGCGCGCAGGCCAGGCCGGCGCAGAGACGAAGAAGGGGGTTAGTCCAA 
GTCTGCTGTTGCATGAGCGGCTCCAGAGCTGTAATCGACCTTGCGGCCCTTGA 
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>ORF594 (SEQ ID N0:12} 

CCA6GCC6ACTACACCCGCACG6CCAGCAACGAGATCCACAGTCAGTTCAAACGACTGCCCAATCCC6ACCTGGTGAT6T 
ATGTGTTCCCGCACCTGGCCGGCAGCGATCCCGCCCCGGTACCGGGCTACACCACCGTGTTCCCCTTCTACCAGCGAGTC 
CAGTACGCQTGCCGGGCGMCGCACGGAGGACTATTGATGGGCTTTTTTCAAACCCTTCTGCGCGGTCGCACACAGCCT 
CAGTCGGTACCGGCAGACGCTCCCGAAGATTCAGGAGCGCTGGACGTAGCGGCCGCGGAAGAAGCGACTGAGCGCTATCT 
GGCGCGACTGGCCGCCATGGGTATTCCTCTGCCCAACACCGGGAGCAAGAATGGCGCCACGCAGGCCGAAGCGTCACGCC 
TCTACGATCACGACCCATCGTTCGTAGACCTGCTGCCCTGGGCTGAGTACCTGCCCGACGAGCAAGTGATGCTCCTGGAG 
GATGGGCGTTCGCGCGCCGCATTCTTCGAACTGGTGCCCTTGGGCACCGAGGGCCGCGATCCCAATTGGATGCAGAACGC 
CCGGGACGCATTGAAAGAAGCCCTGCAGAACTCCTTCGACGAGCACGAAACCTCACCCTGGATTGTCCAGTTCTACGCCC 
AGGACGAGATCAGCTGGGACAATTTCCAGGAGCAGTTGAGGCAGTACGTCCATCCTCGAGCGCGAGGATCGGCCTTCAGC 
GAGATGTACCTGGCGCTCATGAAGCATCACCTGGAGGGCATTTCGAAGCCGGGCGGACTGTTCGTCGACACCGCCGTCAG 
CAAGCTGCCCTGGCGAGGACAACAGCGCCGCGTGCGGATGGTCGTCTACCGCCGGATCCGCAAGGAGGATGCGCAGATTC 
GCGGACAGGACCCGGCGGCGTACCTGAAATCCATCTGCGAGCGTATCCAAGGCGGCCTGGCGAACGCCGGCATCGTCGCT 
TCGCGCATGGGCGGACAGGAGATCAGGAACTGGTTGATCCGCTGGTTCAACCCGCACCCGGATCACCTCGGCCAGGCCGA 
GGCGGACCTACGTCGCTTCTACGAACTGGTATGCCGTCCGGACGAACCGATCCTGCAGGATGAAnGCCACTGGCCGACG 
GCACTGACTTCTCCCAGAACCTGTTCTATCGGCAGCCTGTTTCCGATGCCACCCAGGGCGTATGGCTCTTCGATGCCATG 
CCGCACCGAGTGATTGTGGTCGACCAGTTGAACAAAGCGCCGCTGACAGGCCACTTCACCGGCGAGACGCTCAAAGGCGA 
TGGCCTCAACGCCCTGTTCGATCGAATGCCCGAGGACACGCTGCTGTGCATCACCATGGTCGTGACGCCGCAGGACATGC 
TGGAAGGGCATCTGCAGCAGCTCTCGAAAAAGGCCGTTGGTGACACCCAGGCCTCGATCCACACCCGCGAGGACGTGGCC 
ACCGTTCGACGCCTGATCGGCCGGGAGCACAAGCTCTATCGCGGAGCGATCGCTCTGTTCGTGCGCGGCCGCGACCATAC 
CCAGTTGGAGGAACGCTGCATCACCCTGAGCAACGTACTGCTCGGCGCCGGCCTGGTGCCGGTCGAACCGCAGAACGAAG 
TCGGACCGCTGAACAGCTACCTGCGCTGGCTCCCCTCAAACTTCGATCCAAACGAGAAGCGAGCCCTGGAGTGGTACACC 
CAGATGATGTTCGCTCAGCACATCGCCAACCTGTCGCCCATCTGGGGGCGCACCACCGGTACCGGACACCCTGGCTTCAC 
GCTGTTCAACCGTGGCGGCGCGCCGTTGACCTTCGACCCGTTCAACAAGCTGGACCGGCAGATGAATGCCCACGGCTTCA 
TCTTCGGGCCAACTGGCTCCGGCAAGTCGGCGTCCCTGACCAACCTCATCTGCCAGATGCTCGCCATGTACCTGCCGCGG 
ATGTTCGTCGCGGAAGCGGGCAACAGCTTCGGCCTGCTGGCCGACTTAGCCAAGCGGTTTGGCCTCTCGGTCCACCGGGT 
GCGCCTCGCCCCGGGCTCCGGCGTCAGCCTGGCGCCGTTCGCGGACGCCATCAAGCTGGTCGAGAGCCCCGACCAAGTGA 
AGGTGCTGGACGCCGAAGACATCGAGGCCTCGGACTCGGTCCAGGGCAGCAAGGCCGACCTCGAGGACGACCAGCGAGAC 
ATCCTGGGCGAGATGGAGATCGTCGCCCGCCTCATGATTACCGGTGGCGAAGAGAAGGAAGATGCGCGCCTGACCCGTGC 
CGATCGCAGCGCCGTCCGCCAGGCGATCCTGGCGGCGGCCAGGACCTGCGCCGCCGCGAACCGCACGGTACTGACCCAAG 
ACGTGCGCGATGCGCTCTACGAGGCCTCCAGGAGCGATAGCACCGCGCCAGAACGCCGCGCGCGGATCGCCGAAATGGCG 
GAAGCCATGCAGATGTTCTGCATGGGCGCCGACGGCGAGATGTTCAATCGCGAAGGCACGCCCTGGCCTGAGGCCGACCT 
TACCGTGGTGGATTTCGCAACGTACGCGCGCGAAGGCTACGCCGCCCAGCTCGGGATCGCCTACATCTCGCTGCTGAACA 
CCGTGAACMCATCGCCGMCGCGACCAGTTCAAGGGCCGGCCAATCGTCAAGATCACCGATGAGGGGCACATCATCACC 
MGQCCCGCTGCTGCTGCCCTACGCCATGAAGATCACCMGATGTGGCGGAAACTGGGCGCCTGGTTCTGGCTCGCCAC 
CCAGAACATCGACGACATCCCAGCCTCCGGGGCGCCGATGCTGAACATGATCGAGTGGTGGTTGTGCCTGAACATGCCCC 
CCGACGAAGTAGAGAAGATATCCAGGTTCCGCGAGCTGTCGCCGGCGCAGAAGTCGATGATGCTCTCGGCCCGCAAGGAA 
AGCGGCAAGTTCACCGAGGGCGTGCTCCTGGCCAAGGGCAAAGAATACCTCGTCCGTGTGGTTCCCCCGAGTCTCTACCT 
GGCCCTGGCCATGACCGAAAACGAAGAAAAGAACCAGCGCTACAACATCATGCMGCCACCGGCTGCGACGAGCTCGAGG 
CGGCCTTGCAGGTCGCAGCGGATCTCGACAAGGCGCGCGGCCTGCCACCCTTCCCCATTGTTTTCCCAGACCAACCGGCA 
GTGGAGTGCCAGGACGAATGA 

>ORF1040 (SEQ ID N0:14) 

GTACCTGCCCGACGAGCAAGTGATGCTCCTGGAGGATGGGCGTTCGCGCGCCGCATTCTTCGAACTGGTGCCCTTGGGCA 

CCGAGGGCCGCGATCCCAATTGGATGCAGAACGCCCGGGACGCATTGAAAGAAGCCCTGCAGAACTCCTTCGACGAGCAC 

GAAACCTCACCCTGGATTGTCCAGTTCTACGCCCAGGACGAGATCAGCTGGGACAATTTCCAGGAGCAGTTGAGGCAGTA 

CGTCCATCCTCGAGCGCGAGGATCGGCCTTCAGCGAGATGTACCTGGCGCTCATGAAGCATCACCTGGAGGGCATTTCGA 

AGCCGGGCGGACTGTTCGTCGACACCGCCGTCAGCAAGCTGCCCTGGCGAGGACAACAGCGCCGCGTGCGGATGGTCGTC 

TACCGCCGGATCCGCAAGGAGGATGCGCAGATTCGCGGACAGGACCCGGCGGCGTACCTGAAATCCATCTGCGAGCGTAT 

CCAAGGCGGCCTGGCGAACGCCGGCATCGTCGCTTCGCGCATGGGCGGACAGGAGATCAGGAACTGGTTGATCCGCTGGT 

TCAACCCGCACCCGGATCACCTCGGCCAGGCCGAGGCGGACCTACGTCGCTTCTACGAACTGGTATGCCGTCCGGACGAA 
CCGATCCTGCAGGATGA 
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>ORF1640c (SEQ ID N0:16) 

GTCC6CCTCGGCCTG6CC6A66T6ATCCG6GT6C6GGTTGAACCAGCGGATCAACCAGTTCCTGATCTCCTGTCCGCCCA 

TGCGCGAAGCGACGATGCCGGCGTTCGCCAGGCCGCCTTGGATACGCTCGCAGATGGATTTCAGGTACGCCGCCGGGTCC 

TGTCCGCGAATCTGCGCATCCTCCTTGCGGATCCGGCGGTAGACGACCATCCGCACGCGGCGCTGTTGTCCTCGCCAGGG 

CAGCTTGCTGACGGCGGTGTCGACGAACAGTCCGCCCGGCTTCGAAATGCCCTCCAGGTGATGCTTCATGAGCGCCAGGT 

ACATCTCGCTGAAGGCCGATCCTCGCGCTCGAGGATGGACGTACTGCCTCAACTGCTCCTGGAAATTGTCCCAGCTGATC 
TCGTCCTGGGCGTAGAACTGGACAATCCAGGGTGA 

>ORF2228c (SEQ ID N0:18) 

GGGGAGCCAGCGCAGGTAGCTGTTCAGCGGTCCGACTTCGTTCTGCGGTTCGACCGGCACCAGGCCGGCGCCGAGCAGTA 
CGTTGCTCAGGGTGATGCAGCGTTCCTCCAACTGGGTATGGTCGCGGCCGCGCACGAACAGAGCGATCGCTCCGCGATAG 
AGCTTGTGCTCCCGGCCGATCAGGCGTCGAACGGTGGCCACGTCCTCGCGGGTGTGGATCGAGGCCTGGGTGTCACCAAC 
GGCCTTTTTCGAGAGCTGCTGCAGATGCCCTTCCAGCATGTCCTGCGGCGTCACGACCATGGTGATGCACAGCAGCGTGT 
CCTCGGGCATTCGATCGAACAGGGCGTTGAGGCCATCGCCTTTGAGCGTCTCGCCGGTGAAGTGGCCTGTCAGCGGCGCT 
TTGTTCAACTGGTCGACCACAATCACTCGGTGCGGCATGGCATCGAAGAGCCATACGCCCTGGGTGGCATCGGAAACAGG 
CTGCCGATAGAACAGGTTCTGGGAGAAGTCAGTGCCGTCGGCCAGTGGCAATTCATCCTGCAGGATCGGTTCGTCCGGAC 
GGCATACCAGTTCGTAGAAGCGACGTAG 

>ORF2068c (SEQ ID NO:20) 

AGCTTGTGCTCCCGGCCGATCAGGCGTCGAACGGTGGCCACGTCCTCGCGGGTGTGGATCGAGGCCTGGGTGTCACCAAC 

GGCCTTTTTCGAGAGCTGCTGCAGATGCCCTTCCAGCATGTCCTGCGGCGTCACGACCATGGTGATGCACAGCAGCGTGT 

CCTCGGGCATTCGATCGAACAGGGCGTTGAGGCCATCGCCTTTGAGCGTCTCGCCGGTGAAGTGGCCTGTCAGCGGCGCT 

TTGTTCAACTGGTCGACCACAATCACTCGGTGCGGCATGGCATCGAAGAGCCATACGCCCTGGGTGGCATCGGAAACAGG 
CTGCCGATAG 

>ORF1997 (SEQ ID NO:22) 

CACCCAGGCCTCGATCCACACCCGCGAGGACGTGGCCACCGTTCGACGCCTGATCGGCCGGGAGCACAAGCTCTATCGCG 
GAGCGATCGCTCTGTTCGTGCGCGGCCGCGACCATACCCAGTTGGAGGAACGCTGCATCACCCTGAGCAACGTACTGCTC 
GGCGCCGGCCTGGTGCCGGTCGAACCGCAGAACGAAGTCGGACCGCTGAACAGCTACCTGCGCTGGCTCCCCTCAAACTT 
CGATCCAAACGAGAAGCGAGCCCTGGAGTGGTACACCCAGATGATGTTCGCTCAGCACATCGCCAACCTGTCGCCCATCT 
GGGGGCGCACCACCGGTACCGGACACCCTGGCTTCACGCTGTTCAACCGTGGCGGCGCGCCGTTGACCTTCGACCCGTTC 
AACAAGCTGGACCGGCAGATGAATGCCCACGGCTTCATCTTCGGGCCAACTGGCTCCGGCAAGTCGGCGTCCCTGACCAA 
CCTCATCTGC(^GATGCTCGCCATGTACCTGCCGCGGATGTTCGTCGCGGAAGCGGG(^CAGCTTCGGCCTGCTGGCCG 
ACTTAGCCAAGCGGTTTGGCCTCTCGGTCCACCGGGTGCGCCTCGCCCCGGGCTCCGGCGTCAGCCTGGCGCCGTTCGCG 
GACGCCATCAAGCTGGTCGAGAGCCCCGACCAAGTGAAGGTGCTGGACGCCGAAGACATCGAGGCCTCGGACTCGGTCCA 
GGGCAGCAAGGCCGACCTCGAGGACGACCAGCGAGACATCCTGGGCGAGATGGAGATCGTCGCCCGCCTCATGATTACCG 
GTGGCGAAGAGAAGGAAGATGCGCGCCTGACCCGTGCCGATCGCAGCGCCGTCCGCCAGGCGATCCTGGCGGCGGCCAGG 
ACCTGCGCCGCCGCGAACCGCACGGTACTGACCCAAGACGTGCGCGATGCGCTCTACGAGGCCTCCAGGAGCGATAG 

>ORF2558c (SEQ ID NO:24) 

GTCGGCCAGCAGGCCGAAGCTGTTGCCCGCTTCCGCGACGAACATCCGCGGCAGGTACATGGCGAGCATCTGGCAGATGA 

GGTTGGTCAGGGACGCCGACTTGCCGGAGCCAGTTGGCCCGAAGATGAAGCCGTGGGCATTCATCTGCCGGTCCAGCTTG 

TTGAACGGGTCGAAGGTCAACGGCGCGCCGCCACGGTTGAACAGCGTGAAGCCAGGGTGTCCGGTACCGGTGGTGCGCCC 

CCAGATGGGCGACAGGTTGGCGATGTGCTGAGCGAACATCATCTGGGTGTACCACTCCAGGGCTCGCTTCTCGTTTGGAT 
CGAAGTTTGA 
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>ORF2929c (5EQ ID NO:26) 

A6C6CATCGC6CACGTCTTG6GTCA6TACCGTGC66TTCGCGGCG6CGCA6GTCCTG6CCGCC6CCAGGATCGCCT6GCG 

GACGGCGCTGCGATCGGCACGGGTCAGGCGCGCATCTTCCTTCTCTTCGCCACCGGTAATCATGAGGCGGGCGACGATCT 

CCATCTCGCCCAGGATGTCTCGCTGGTCGTCCTCGAGGTCGGCCTTGCTGCCCTGGACCGAGTCCGAGGCCTCGATGTCT 

TCGGCGTCCAGCACCTTCACTTGGTCGGGGCTCTCGACCAGCTTGATGGCGTCCGCGAACGGCGCCAGGCTGACGCCGGA 

GCCCGGGGCGAGGCGCACCCGGTGGACCGAGAGGCCAAACCGCTTGGCTAAGTCGGCCAGCAGGCCGAAGCTGTTGCCCG 

CTTCC<mCX»A<»TCCG^^ 

<XMTTC«CaiAM»TG^ 

GCCACGGTTGAACAGCGTGAAGCCAGGGTGTCCGGTACCGGTGGTGCGCCCCCAGATGGGCGACAGGTTGGCGATGTGCT 
GA 

>ORF3965c (SEQ ID NO:28) 

GCGCCTGTTGGGCCGTATCAGGCTGTGGATGTTGTTGCAGCCATTCATCCAAGAGCTGCTTTATCTGCGGGACGATATCC 
CGGCGATCGACTGCCCTCAGTTGAATCTGCTGCAGCTCCTCTATCAGTACAGGAGCGCATATCCTTAGCGTCTGCAGGGC 
ATCCTCTTCGGGGTTCTGCAGGATCTGGGTCAGGTTGTCGATCAGGTTCTGGGTCAGCGAATTCAGAACTCTCATTCGTC 
CTGGCACTCCACTGCCGGTTGGTCTGGGAAAACAATGGGGAAGGGTGGCAGGCCGCGCGCCTTGTCGAGATCCGCTGCGA 
CCTGCAAGGCCGCCTCGAGCTCGTCGCAGCCGGTGGCTTGCATGATGTTGTAGCGCTGGTTCTTTTCnCGTTTTCGGTC 
ATGGCCAGGGCCAGGTAGAGACTCGGGGGAACCACACGGACGAGGTATTCTTTGCCCTTGGCCAGGAGCACGCCCTCGGT 
GAACTTGCCGCTTTCCTTGCGGGCCGAGAGCATCATCGACTTCTGCGCCGGCGACAGCTCGCGGAACCTGGATATCTTCT 
CTACTTCGTCGGGGGGCATGTTCAGGCACAACCACCACTCGATCATGTTCAGCATCGGCGCCCCGGAGGCTGGGATGTCG 
TCGATGTTCTGGGTGGCGAGCCAGAACCAGGCGCCCAGTTTCCGCCACATCTTGGTGATCTTCATGGCGTAGGGCAGCAG 
CAGCGGGTGCTTGGTGATGATGTGCCCCTCATCGGTGATCTTGACGATTGGCCGGCCCTTGAACTGGTCGCGTTCGGCGA 
TGTTGTTCACGGTGTTCAGCAGCGAGATGTAGGCGATCCCGAGCTGGGCGGCGTAGCCTTCGCGCGCGTACGTTGCGAAA 
TCCACCACGGTAAGGTCGGCCTCAGGCCAGGGCGTGCCTTCGCGATTGAACATCTCGCCGTCGGCGCCCATGCAGAACAT 
CTGCATGGCTTCCGCCATTTCGGCGATCCGCGCGCGGCGTTCTGGCGCGGTGCTATCGCTCCTGGAGGCCTCGTAGAGCG 
CATCGCGCACGTCTTGGGTCAGTACCGTGCGGTTCGCGGCGGCGCAGGTCCTGGCCGCCGCCAGGATCGCCTGGCGGACG 
GCGCTGCGATCGGCACGGGTCAGGCGCGCATCTTCCTTCTCTTCGCCACCGGTAATCATGAGGCGGGCGACGATCTCCAT 
CTCGCCCAGGATGTCTCGCTGGTCGTCCTCGAGGTCGGCCTTGCTGCCCTGGACCGAGTCCGAGGCCTCGATGTCTTCGG 
CGTCCAGCACCTTCACTTGGTCGGGGCTCTCGACCAGCTTGATGGCGTCCGCGAACGGCGCCAGGCTGACGCCGGAGCCC 
GGGGCGAGGCGCACCCGGTGGACCGAGAGGCCAAACCGCTTGGCTAA 

>ORF3218 (SEQ ID NO:30) 
GGGGCACATCATCACCAAGCACCCGCTGCTGCTC^ 

GGTTCTGGCTCGCCACCCAGAACATCGACGACATCCCAGCCTCCGGGGCGCCGATGCTGAACATGATCGAGTGGTGGTTG 
TGCCTGAACATGCCCCCCGACGAAGTAGAGAAGATATCCAGGTTCCGCGAGCTGTCGCCGGCGCAGAAGTCGATGATGCT 
CTCGGCCCGCAAGGAAAGCGGCAAGTTCACCGAGGGCGTGCTCCTGGCCAAGGGCAAAGAATACCTCGTCCGTGTGGTTC 
CCCCGAGTCTCTACCTGGCCCTGGCCATGACCGAAAACGAAGAAAAGAACCAGCGCTACAACATCATGCAAGCCACCGGC 
TGCGACGAGCTCGAGGCGGCCTTGCAGGTCGCAGCGGATCTCGACAAGGCGCGCGGCCTGCCACCCTTCCCCATTGTTTT 
CCCAGACCAACCGGCAGTGGAGTGCCAGGACGAATGAGAGTTCTGAATTCGCTGACCCAGAACCTGATCGACAACCTGAC 
CCAGATCCTGCAGAACCCCGAAGAGGATGCCCTGCAGACGCTAAGGATATGCGCTCCTGTACTGATAGAGGAGCTGCAGC 
AGATTCAACTGAGGGCAGTCGATCGCCGGGATATCGTCCCGCAGATAAAGCAGCTCTTGGATGAATGGCTGCAACAACAT 
CCACAGCCTGATACGGCCCAACAGGCGCTCATTGAGGCCGTGGACCGCGCGGAGATCCTACAGCGGAGGCAAGCGTGA 

>ORF3568 (SEQ ID NO:32) 

CCGAAAACGAAGAAAAGAACCAGCGCTACAACATCATGCAAGCCACCGGCTGCGACGAGCTCGAGGCGGCCTTGCAGGTC 
GCAGCGGATCTCGACAAGGCGCGCGGCCTGCCACCCTTCCCCATTGTTTTCCCAGACCAACCGGCAGTGGAGTGCCAGGA 
CGAATGAGAGTTCTGAATTCGCTGACCCAGAACCTGATCGACAACCTGACCCAGATCCTGCAGAACCCCGAAGAGGATGC 
CCTGCAGACGCTAAGGATATGCGCTCCTGTACTGATAGAGGAGCTGCAGCAGATTCAACTGAGGGCAGTCGATCGCCGGG 
ATATCGTCCCGCAGATAAAGCAGCTCTTGGATGA 
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>ORF4506c (SEQ ID NO:34) 

GTCAATAAGTTCGTTGTCTTTCGCACATTTCTCCA6TCGA6CCTGGTCCAGTTCAGGAAA6TCCAATGTGCCGCCA6GCA 

GCCCGCCCCCGTTGCCGGCCGACTGAGCGAAGATCGCATCGATAGCGCTCCAGAAGGCTTTGGCGCCGCCTTGGATCCCC 

GCGCACTCCACCAGGCGAGCCTGGTGGCGGGCCGCCTCGCCATGCATCTGCAGGGGAAGATGGCGCCAAACCAGGTTCAC 

GTCCGGATGGCTGTCTACCCA(X:GCTTMGCCGCGGGGTGTA(^CCTTGCAGAA(M(MCACTCCAGGTCGGCGTATTCAT 

TGATCGTCCAGCGCGCTTTCGCATCGCCGTAGAGGCTGTGGTTGGCTGGCAGGCCCTTCACCAGAAGCTCTACCCCTACG 

GCGGATGCAGCCAGCAAGACCAGCAGCAGCCCCGCCCAGGGCAGGGCGGGACCTTGAAATCGTTTGGCTGCCCAGCCGCC 

CTTCAAGAGTCTCACGCTTGCCTCCGCTGTAGGATCTCCGCGCGGTCCACGGCCTCAATGAGCGCCTGTTGGGCCGTATC 

AGGCTGTGGATGTTGTTGCAGCCATTCATCCAAGAGCTGCTTTATCTGCGGGACGATATCCCGGCGATCGACTGCCCTCA 
GTTGA 

>ORF3973 (SEQ ID NO:36) 
GGCCGTGGACCGCGCGGAGATCCTACAGCGGAGGCM^ 

AGGTCCCGCCCTGCCCTGGGCGGGGCTGCTGCTGGTCTTGCTGGCTGCATCCGCCGTAGGGGTAGAGCTTCTGGTGAAGG 
GCCTGCCAGCCAACCACAGCCTCTACGGCGATGCGAAAGCGCGCTGGACGATCAATGAATACGCCGACCTGGAGTGCCCC 
TTCTGCAAGGTCTACACCCCGCGGCTTAAGCGCTGGGTAGACAGCCATCCGGACGTGAACCTGGTTTGGCGCCATCTTCC 
CCTGCAGATGCATGGCGAGGCGGCCCGCCACCAGGCTCGCCTGGTGGAGTGCGCGGGGATCCAAGGCGGCGCCAAAGCCT 
TCTGGAGCGCTATCGATGCGATCTTCGCTCAGTCGGCCGGCAACGGGGGCGGGCTGCCTGGCGGCACATTGGACTTTCCT 
GAACTGGACCAGGCTCGACTGGAGAAATGTGCGAAAGACAACGAACTTATTGACTCAGATATCAAGTTGGACATCGACAT 
TGCACGGTCGAAGGGCATTACAGCGACCCCGACCCTCGTCATCCGGGACAACCAGACGGGACGAAGCGTGAAGCTTGAAG 
GCATGGCCGACGAGACCACGTTGCTGTCGGCGATAGACTGGCTAGCCAAGGATCTCTAG 

>ORF4271 (SEQ ID NO:38) 

ACCTGGTTTGGCGCCATCTTCCCCTGCAGATGCATGGCGAGGCGGCCCGCCACCAGGCTCGCCTGGTGGAGTGCGCGGGG 

ATCCAAGGCGGCGCCAAAGCCTTCTGGAGCGCTATCGATGCGATCTTCGCTCAGTCGGCCGGCAACGGGGGCGGGCTGCC 

TGGCGGCACATTGGACTTTCCTGMCTGGACCAGGCTCGACTGGAGAMTGTGCGAAAGACAACGAACTTATTGACTCAG 

ATATCAAGTTGGACATCGACATTGCACGGTCGAAGGGCATTACAGCGACCCCGACCCTCGTCATCCGGGACAACCAGACG 
GGACGAAGCGTGA 

>ORF4698 (SEQ ID NO:40) 

GAAATCGGCGAGGATTCCAACATCCCTCTTTTGGTCCTCCAGGATGCCCTGCACTTCACCTGGCAGAACCTCGACCTCCT 

CCCCATCCACAATCTTTACCATTCTCTTGTGGCCGGAGCTGGTGAGGCTAAGCCTCAACTCCATTGCCGGCCGAGCATTG 

ATGTAAATGCTCTCGAGCAAGCGCTCCATGACTTCGACCACTCCTTMTAT(^GTTAGCCAGCTACATACAGGAATTATG 

CTACCCAGGACATGCAGGCGTCACCCCTACTTATGTACGTGGCAGCGTTCGATCACGGCTCGAAAAAATACACCACCTAC 
GAGTTGA 

>ORF5028 (SEQ ID NO:42) 

TTTCCTGCTGCCCTATCGGAAGTGATCCTGTCTGCTGTCTGTACCTTTCTAGAACCGGTACAGACCCATGCCTCTTCATC 
ACTCCCCCCCTGGCCGGCGGCCACCAACGCTGGCCGTTGGCGTACTACTGGTACTGCTGAGCAGCGCGAGTCAGGCCGAA 
ACCfGGGTCATCACCGACAAGGCTCATCCGGTCTCTGCCACCGGATCGTCGCGCGTTCTGTTTCTGGACGCCCAGGAACA 
CCTCGAGGAGCAACTGACTGCGGCCTTGCCCCAGGATCCACAGCATGCTCAAGCGGCGTTTAA 

>ORF5080 (SEQ ID NO:44) 

AACCGGTAMCCCATGCCTCTTCATCACTCCCCCCCTGGCCGGCGGCCACCAACGCTGGCCGTTGGCGTACTACTGGT 
ACTGCTGAGCAGCGCGAGTCAGGCCGAAACCTGGGTCATCACCGACAAGGCTCATCCGGTCTCTGCCACCGGATCGTCGC 
GCGTTCTGTTTCTGGACGCCCAGGAACACCTCGAGGAGCAACTGACTGCGGCCTTGCCCCAGGATCCACAGCATGCTCAA 
GCGGCGTTTAAGCGATTGCTACAAAGCCCCGATGGGCGCCGCCTGCAGGCAGAGCTGGTCAAGGCACAACAAGACGTCGC 
CGATGCGTGGAGTCTCGGTGTCGAGAAGATCCCTGCCGTAGTAGTCGATAGGCAGTACGTGGTCTACGGCGAACCGGATG 
TTTCGCGCGCTCTTGAGCTAATCGCCAAGGCCAGGAGGTCGCGCTGA 
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>ORF6479c (SEQID NO:46) 

TTC6TCTCCGT6TCCTTATT66AAGTC66TACT6CA6ATGAACATCTGCCCCTT6CGCTG6CAGCAGGAGTAGGGAC6CC 
AGAGCGCCCAGGCGTGCTCCCCGTCGACGGCTTGCGTCTTCGGCCCAGAGTTGGGAAACACCGCGCAGTTGAGGCTCAGG 
GATGGGGTCAGCTCCTGCCATTTCCCGGTCGAGGCATCGCCCTCTTTCAGCTCGCCCGCCGGCCAGTAGCCGTCCTTGGG 
GGCTGCGCGCATGGGGAGGTAGACGTGGAGCTGGCCGATTCGCGTGGTGATATCGCCGGCGCGCTGGGCGATGACGGCTG 
CCGTCTTGTAGTCGTCGGTCTGGTGCAGGAAGCCGCTGCGCGGATAGAGGTTCCCCCACATGTCGCCGGAGAAGATTCCA 
CCCACCTCGCGCAGCCCTGGGACCAACGCTTCGGGGTAQ^ 

TGTGCTGAGmGTACGGCACCAGCG(MACGGTGGCGCCAGGGa^CGTACCCAGAGGCGCTGGCGAACCGGCTGAACG 
TGGCGCCACCAGGATGGCCGATCACATCCGCTTCCTTGAAGCGGCCGATGCTGTTCTCGGCCTTGTAGTTTGTGGTCGCG 
TCATTGCCGGCCTGGGCGAGTGGATTCGGTGTACCCAGCGCCGATACCTCGGTCCAGGGGTTGCTCCCGGTATTCGCGTA 
GCTGGAGACGACTGCGTCAGGCACGTAGTGGCGGACCTTGACCGACGTCTTCACTTTGCAGCCATGCGGGCCGCAGAGCA 
GCCAGTAACAGATCCCGACGACCTTGTATTCGAGGCACTGAGGGGAAAGGGTGGAGGAGACGATGGCAGCGCTGTTGATC 
GCGGCCGAGGCCGTGAACGAGAGGCTGAAGGTGGCGGCCGCCGCTGCCAGGCGGCGGAGGTTGAGGCTGGTCATCAGCGC 
GACCTCCTGGCCTTGGCGATTAGCTCAAGAGCGCGCGAAACATCCGGTTCGCCGTAG 

>ORF5496 (SEQ ID NO:48) 

GCTAATCGCCAAGGCCAGGAGGTCGCGCTGATGACCAGCCTCAACCTCCGCCGCCTGGCAGCGGCGGCCGCCACCTTCAG 
CCTCTCGTTCACGGCCTCGGCCGCGATCAACAGCGCTGCCATCGTCTCCTCCACCCTTTCCCCTCAGTGCCTCGAATACA 
AGGTCGTCGGGATCTGTTACTGGCTGCTCTGCGGCCCGCATGGCTGCAAAGTGAAGACGTCGGTCAAGGTCCGCCACTAC 
GTGCCTGACGCAGTCGTCTCCAGCTACGCGAATACCGGGAGCAACCCCTGGACCGAGGTATCGGCGCTGGGTACACCGAA 
TCCACTCGCCCAGGCCGGCAATGACGCGACCACAAACTACAAGGCCGAGAACAGCATCGGCCGCTTCAAGGAAGCGGATG 
TGATCGGCCATCCTGGTGGCGCCACGTTCAGCCGGTTCGCCAGCGCCTCTGGGTACGTTTGCCCTGGCGCCACCGTCCCG 
CTGGTGCCGTACTTTCTCAGCACACTGGACGCCATTGGCTGGCGGCATGGAATTCCCGAGCAGGTGTACCCCGAAGCGTT 
GGTCCCAGGGCTGCGCGAGGTGGGTGGAATCTTCTCCGGCGACATGTGGGGGAACCTCTATCCGCGCAGCGGCTTCCTGC 
ACCAGACCGACGACTACAAGACGGCAGCCGTCATCGCCCAGCGCGCCGGCGATATCACCACGCGAATCGGCCAGCTCCAC 
GTCTACCTCCCCATGCGCGCAGCCCCCAAGGACGGCTACTGGCCGGCGGGCGAGCTGAAAGAGGGCGATGCCTCGACCGG 
GAAATGGCAGGAGCTGACCCCATCCCTGAGCCTCAACTGCGCGGTGTTTCCCAACTCTGGGCCGAAGACGCAAGCCGTCG 
ACGGGGAGCACGCCTGGGCGCTCTGGCGTCCCTACTCCTGCTGCCAGCGCAAGGGGCAGATGTTCATCTGCAGTACCGAC 
TTCCAATAA 
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>ORF5840 (SEQ ID NO:50) 

CGC6ACCACmCTACMG6CC6A6AACAGCATCGGCC6CTTCMGGAA6CGGAT6T6ATCGGCCATCCTGGTG6C6CCA 
CGTTCAGCCGGTTCGCCAGCGCCTCTGGGTACGTTTGCCCTGGCGCCACCGTCCCGCTGGTGCCGTACTTTCTCAGCACA 
CTGGACGCCATTGGCTGGCGGCATGGAATTCCCGAGCAGGTGTACCCCGAAGCGTTGGTCCCAGGGCTGCGCGAGGTGGG 
TGGAATCTTCTCCGGCGACATGTGGGGGAACCTCTATCCGCGCAGCGGCTTCCTGCACCAGACCGACGACTACAAGACGG 
CAGCCGTCATCGCCCAGCGCGCCGGCGATATCACCACGCGAATCGGCCAGCTCCACGTCTACCTCCCCATGCGCGCAGCC 
CCCAAGGACGGCTACTGGCCGGCGGGCGAGCTGAAAGAGGGCGATGCCTCGACCGGGAAATGGCAGGAGCTGACCCCATC 
CCTGAGCCTCAACTGCGCGGTGTTTCCCAACTCTGGGCCGAAGACGCAAGCCGTCGACGGGGAGCACGCCTGGGCGCTCT 
GGCGTCCCTACTCCTGCTGCCAGCGCAAGGGGCAGATGTTCATCTGCAGTACCGACTTCCAATAAGGACACGGAGACGAA 
TCATGCGAATGAACATCACCTCGGTCGCGCTAATGTGGCTGCTCGCAGCGCAACTTGCCCAGGCCGACGACCCGATCAAC 
GTGTCCAAGACCGGCACGGTGCTCAGCGACGAGGTCCTCTACAGCATTGGCGGCGGCAGTGCGGTGAGCATGGGCAGCGC 
CGGCCAGATGGACTCGATCGGCGTCGGCTTCGGCTGGAACAACGACATGATGTGCGGAAACATGAACCTGAGCACCACCC 
TGGAGAACCAGCTCAACGGTGCCACACAGGGTTTCCAGAACATCATGGGCTCAGTCATCCAGAACGCGACCGGCGCGGTC 
ATGTCGCTGCCGGCGTTGATCATCCAGCGCGCGAACCCTCAGCTCTACAACCTGATCACCAATGGCATCCTGCAGGCGCG 
GATCGACTACGACCGCTCGAAAGGGACTTGCAAAACGATCGCCGAAAAGATGGCTGACATCGCTGGCGAGCAGACCGGCT 
GGGGGAAAATCGCCGAAGGCCAAGCCCTGGGCGCCACACTGGCCTCTGACGGGAAAGACGCCGTATCCGCCCTCGAAGCA 
GTGGAGAAGAAAGGCGGCAACGATGGCGTAACCTGGGTTGGTGGAGACAAGGCCGGCGGCTCCGGCCAGAAGCCCATTCG 
CATCGTCAACGACGTGACCCGGGCGGGCTACAACCTGTTGACCAGCCGCTCGGTGAATGATTCGTCGAGCGTGCCTTCCG 
CCACTTGCAACAACGGCCTGGTCTGCAACACTTGGTCCTCCCCCCAGGAGGCCGCCGCATTCGCCACCCGGGTACTGGGG 
GAGCAACAGCAACAGACCTGCGAAGGCTGCCAGAAGACGGTGACGGCTGCTGGCGTCGGCCTCACCCCGCTGATCCAGGA 
GACCTACGACAAGAAGCTCCAGTCGCTGCAGGAGCTGCTGTCGAAGAGCAAACCACTGACTGCAGAGAACCTGGCTGCGG 
CCGGCACCGATGCTCTGCCAATTACCCGCGGCGTCATCGAGGCGCTGCGCGACGAGCGTGACCAGGACGTCCTGGCGCGC 
CGCCTGGCGTCCGATGTCTCCCTGATGGACGTGCTCAGCAAGGCACTGCTACTGCAGCGCCTGATGTTCGCCGGCGCCAA 
GGAGCCCAACGTCGCCGCCAACGGCCTGGCCACCCAAGCCGTCGATCAGCAGACCAGCCTCCTGCAGCAGGAGATCTCCA 
ATCTCAAGACCGAACTGGAACTCCGTCGCGAGTTGGCCAGCAACTCCCCCATGCGGGTCATCGAGCGCGGGCAACAACGC 
GCCTCAGGGTCCAGTGGCGTGTTCGAGTCGGCGCCCGATGCCGATCGCCTCGATCGCCTGCAGGCCCCCTCTGCCGCCGG 
CGGCAAGTCGGGAGGGAGACCGTGA 

>ORF5899 (SEQ ID NO:52) 

TCGGCCATCCTGGTGGCGCCACGTTCAGCCGGTTCGCCAGCGCCTCTGGGTACGTTTGCCCTGGCGCCACCGTCCCGCTG 
GTGCCGTACTTTCTCAGCACACTGGACGCCATTGGCTGGCGGCATGGAATTCCCGAGCAGGTGTACCCCGAAGCGTTGGT 
CCCAGGGCTGCGCGAGGTGGGTGGAATCTTCTCCGGCGACATGTGGGGGAACCTCTATCCGCGCAGCGGCTTCCTGCACC 
AGACCGACGACTACAAGACGGCAGCCGTCATCGCCCAGCGCGCCGGCGATATCACCACGCGAATCGGCCAGCTCCACGTC 
TACCTCCCCATGCGCGCAGCCCCCAAGGACGGCTACTGGCCGGCGGGCGAGCTGA 

>ORF6325 (SEQ ID NO:54) 

GCCTCAACTGCGCGGTGTTTCCCAACTCTGGGCCGAAGACGCAAGCCGTCGACGGGGAGCACGCCTGGGCGCTCTGGCGT 
CCCTACTCCTGCTGCCAGCGCAAGGGGCAGATGTTCATCTGCAGTACCGACTTCCAATAAGGACACGGAGACGAATCATG 
CGAATGAACATCACCTCGGTCGCGCTAATGTGGCTGCTCGCAGCGCAACTTGCCCAGGCCGACGACCCGATCAACGTGTC 
CAAGACCGGCACGGTGCTCAGCGACGAGGTCCTCTACAGCATTGGCGGCGGCAGTGCGGTGAGCATGGGCAGCGCCGGCC 
AGATGGACTCGATCGGCGTCGGCTTCGGCTGGAACAACGACATGATGTGCGGAAACATGAACCTGAGCACCACCCTGGAG 
AACCAGCTCAACGGTGCCACACAGGGTTTCCAGAACATCATGGGCTCAGTCATCCAGAACGCGACCGGCGCGGTCATGTC 
GCTGCCGGCGTTGATCATCCAGCGCGCGAACCCTCAGCTCTACAACCTGATCACCAATGGCATCCTGCAGGCGCGGATCG 
ACTACGACCGCTCGAAAGGGACTTGCAAAACGATCGCCGAAAAGATGGCTGA 
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>ORF7567c (SEQ ID NO:56) 

CA6T6CCTTGCT6A6CACGTCCATCA6G6A6ACATCGGACGCCAGGCGSC6CGCCAG6AC6TCCT6GTCAC6CTCGTCGC 
GCAGCGCCTCGATGACGCCGCGGGTAATTGGCAGAGCATCGGTGCCGGCCGCAGCCAGGTTCTCTGCAGTCAGTGGTTTG 
CTCTTCGACAGCAGCTCCTGCAGCGACTGGAGCTTCTTGTCGTAGGTCTCCTGGATCAGCGGGGTGAGGCCGACGCCAGC 
AGCCGTCACCGTCTTCTGGCAGCCTTCGCAGGTCTGTTGCTGTTGCTCCCCCAGTACCCGGGTGGCGAATGCGGCGGCCT 
CCTGGGGGGAGGACCAAGTGTTGCAGACCAGGCCGTTGTTGCAAGTGGCGGAAGGCACGCTCGACGAATCATTCACCGAG 
CGGCTGGTCAACAGGTTGTAGCCCGCCCGGGTCACGTCGTTGACGATGCGAATGGGCTTCTGGCCGGAGCCGCCGGCCTT 
GTCTCCACCAACCCAGGTTACGCCATCGTTGCCGCCTTTCTTCTCCACTGCTTCGAGGGCGGATACGGCGTCTTTCCCGT 
CAGAGGCCAGTGTGGCGCCCAGGGCTTGGCCTTCGGCGATTTTCCCCCAGCCGGTCTGCTCGCCAGCGATGTCAGCCATC 
TTTTCGGCGATCGTTTTGCAAGTCCCTTTCGAGCGGTCGTAGTCGATCCGCGCCTGCAGGATGCCATTGGTGATCAGGTT 
GTAGAGCTGAGGGTTCGCGCGCTGGATGATCAACGCCGGCAGCGACATGACCGCGCCGGTCGCGTTCTGGATGACTGA 

>ORF7180 (SEQ ID NO:58) 

TTCGTCGAGCGTGCCTTCCGCCACTTGCAACAACGGCCTGGTCTGCAACACTTGGTCCTCCCCCCAGGAGGCCGCCGCAT 
TCGCCACCCGGGTACTGGGGGAGCAACAGCAACAGACCTGCGAAGGCTGCCAGAAGACGGTGACGGCTGCTGGCGTCGGC 
CTCACCCCGCTGATCCAGGAGACCTACGACAAGAAGCTCCAGTCGCTGCAGGAGCTGCTGTCGAAGAGG^AACCACTGAC 
TGCAGAGAACCTGGCTGCGGCCGGCACCGATGCTCTGCCAATTACCCGCGGCGTCATCGAGGCGCTGCGCGACGAGCGTG 
A 

>ORF7501 (SEQ ID NO:60) 

CCAGGACGTCCTGGCGCGCCGCCTGGCGTCCGATGTCTCCCTGATGGACGTGCTCAGCAAGGCACTGCTACTGCAGCGCC 
TGATGTTCGCCGGCGCCAAGGAGCCCAACGTCGCCGCCAACGGCCTGGCCACCCAAGCCGTCGATCAGCAGACCAGCCTC 
CTGCAGCAGGAGATCTCCAATCTCAAGACCGAACTGGAACTCCGTCGCGAGTTGGCCAGCAACTCCCCCATGCGGGTCAT 
CGAGCGCGGGCAACAACGCGCCTCAGGGTCCAGTGGCGTGTTCGAGTCGGCGCCCGATGCCGATCGCCTCGATCGCCTGC 
AGGCCCCCTCTGCCGCCGGCGGCAAGTCGGGAGGGAGACCGTGATGGCAGATACGCTCACCACCCGAAAGCTTCTCGGTC 
AGCTACTGGTCGGAGTGCTGATCGTCATCGGACTGGCAGTGGTCGGTACGCTGCTCAGTCTCTTCGCCCTGAACCACTTC 
GGTGGCATCCAGGGCCTGGAGGCCTGGCGGCAAAGCAACTACTGGAGCTTGTTCGCCTGGCGGGCGCTGCTGTACTGCGC 
CCTGGCCATCGCCTGGTTCCGGCAGCGCAAGGAACTGAGCGCGCATGAGCGGCAGCGCATTCGGCGGATCGAGATCCTGG 
TGCTGTTGCTGGTCCTGCTCATCGAATTCAGCAAAGCCTACTTCCGCACGGGAGGCGCAGCATGA 

>ORF7584 (SEQ ID NO:62) 

TGTTCGCCGGCGCCAAGGAGCCCAACGTCGCCGCCAACGGCCTGGCCACCCAAGCCGTCGATCAGCAGACCAGCCTCCTG 
CAGCAGGAGATCTCCAATCTCAAGACCGAACTGGAACTCCGTCGCGAGTTGGCCAGCAACTCCCCCATGCGGGTCATCGA 
GCGCGGGCAACAACGCGCCTCAGGGTCCAGTGGCGTGTTCGAGTCGGCGCCCGATGCCGATCGCCTCGATCGCCTGCAGG 
CCCCCTCTGCCGCCGGCGGCAAGTCGGGAGGGAGACCGTGATGGCAGATACGCTCACCACCCGAAAGCTTCTCGGTCAGC 
TACTGGTCGGAGTGCTGATCGTCATCGGACTGGCAGTGGTCGGTACGCTGCTCAGTCTCTTCGCCCTGAACCACTTCGGT 
GGCATCCAGGGCCTGGAGGCCTGGCGGCAAAGCAACTACTGGAGCTTGTTCGCCTGGCGGGCGCTGCTGTACTGCGCCCT 
GGCCATCGCCTGGTTCCGGCAGCGCAAGGAACTGAGCGCGCATGA 

>ORF8208c (SEQ ID NO:64) 

AGGTCATGCTGCGCCTCCCGTGCGGAAGTAGGCTTTGCTGAATTCGATGAGCAGGACCAGCAACAGCACCAGGATCTCGA 
TCCGCCGAATGCGCTGCCGCTCATGCGCGCTCAGTTCCTTGCGCTGCCGGAACCAGGCGATGGCCAGGGCGCAGTACAGC 
AGCGCCCGCCAGGCGAACAAGCTCCAGTAGTTGCTTTGCCGCCAGGCCTCCAGGCCCTGGATGCCACCGAAGTGGTTCAG 
GGCGAAGAGACTGAGCAGCGTACCGACCACTGCCAGTCCGATGACGATCAGCACTCCGACCAGTAG 
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>orf8109 (seq id no:66) 

gcggcagcgcattcggcggatcga6atcctggt6ctgtt6ctg6tcct6ctcatcgaattca6caaa6cctacttccgca 
cgggaggcgcagcatgaccttcatgaccaatgactacctggagtattacctcaccctcctcggctggatcatcaacaacg 
g&tci^caw»tctc^ 

cgtggggaaggcgccgacgagggcaacaagggagtgctgtctctcgcccgcatcgagacgcatatctacgtcggctacat 
cgtggtcgccctggcggggatcccggtcgtcaacgtgagcttcgacaccatcgagttcgaccagactcgcgcccagcagt 
gccaatacaatctgccggcaccggcggacaccggctggtcgagctccttcagcagcctggccggcaagagtgcgcagatg 
ccgctctostggmtgatgcacgccctgtccm^^ 

tctgcggcagatgcgaatggaagtggacaacacgcgcgtgaacaatccgctgctggcacaagaaatcgctgatttttcca 
gagactgctacgggccttcccgtgcgcggctgttcatgcggcaacccgacctgggctccgtcgccgaggacaacaaggcg 

TTGCMGACCTGAACTGGATCGGCTCCCGATTCTTGTTGMCACCCCGGGGTACTACGACACCGACTACTCGAAGAGTCC 

CCGTCAGTCGTGGCCCTACAACGCCACCCGCGATGCCGGCCTGCCTCAGGTGGGCGGTGGTGGCGGCTACCCAACCTGCA 

AGCAGTGGTGGGCTGACTCAGGGATCGGCTTGCGTGATCGGATCAAGGACCAGGTGGATCCGGACCTGATGACCAGCTTC 

CTCAAGTGGGCGAAATGGTTGAACCAGGACGAGGTGACCGAGGCTGTCATTCGCCAGGTGATCTCACCCTCCAGCCAGGT 

CAAGGGTMCGTCTACACCGAnACGGCGGGCAGGTGGGCGGCACCGTGTGGMCGGCATCGCGAGAACCGCAGGAACCT 

TCGGCGTTGCGGTGGGCAGCTTGGCATACTTCCCGGCGATGGATATGGTCCGCCAGGCACTGCCGATGGTGATGTCGTTC 

CTGAAGATGGCAATGGTCATCTGCATTCCGATGGTCCTGGTCATCGGCACCTATCAACTGAAAGTTGCCATGACGATGAC 

GGTCGTCTTCTTTGCGATGATGTTCGTCGACTTCTGGTTTCAGTTAGCCAGATATATCGACAGCACGATACTTGATGCTT 

TCTATGGTTCGGGAT(^CCACATCTTTCATTCMCCCAGTCATGGGGCTGAATACGGCTACTCMGATGCGATCTTG 

TTCGTTATGGGTTCTATGTTCATTGTTTTACCACTACTGTGGATGACAGCGATCGGCTGGTCCGGAATTCAAGCAGGGTC 

TGTTCTGAACGGATTGAGCAGAGGGACTGAAGGAGTTCAAGCCGCCGGCAAGGAAGCAGGAAATAGAGTTAAAAACGCAG 
TTTGA 

>ORF9005c (SEQ ID NO:68) 

GTCAGCCCACCACTGCTTGCAGGTTGGGTAGCCGCCACCACCGCCCACCTGAGGCAGGCCGGCATCGCGGGTGGCGTTGT 
AGGGCCACGACTGACGGGGACTCTTCGAGTAGTCGGTGTCGTAGTACCCCGGGGTGTTCAACAAGAATCGGGAGCCGATC 
CAGTTCAGGTCTTGCAACGCCTTGTTGTCCTCGGCGACGGAGCCCAGGTCGGGTTGCCGCATGAACAGCCGCGCACGGGA 
AGGCCCGTAGCAGTCTCTGGAAAAATCAGCGATTTCTTGTGCCAGCAGCGGATTGTTCACGCGCGTGTTGTCCACTTCCA 
TTCGCATCTGCCGCAGATCCGTGCCGCACGGAATGGCCGCGATGGCGCCGCTGGTGAAGCCCTTGGACAGGGCGTGCATC 
ATCGCCCACCAGAGCGGCATCTGCGCACTCTTGCCGGCCAGGCTGCTGAAGGAGCTCGACCAGCCGGTGTCCGCCGGTGC 
CGGCAGATTGTATTGGCACTGCTGGGCGCGAGTCTGGTCGAACTCGATGGTGTCGAAGCTCACGTTGACGACCGGGATCC 
CCGCCAGGGCGACCACGATGTAGCCGACGTAGATATGCGTCTCGATGCGGGCGAGAGACAGCACTCCCTTGTTGCCCTCG 
TCGGCGCCTTCCCCACGAACTTTCAGCCATTCGCGCATCACGATGGCCGCGAACGGCACCGCGAACAGGCCAGTATCCGA 
GATCATGTTCCAGATCCCGTTGTTGATGATCCAGCCGAGGAGGGTGAGGTAATACTCCAGGTAGTCATTGGTCATGAAGG 
TCATGCTGCGCCTCCCGTGCGGAAGTAG 

>ORF8222 (SEQ ID NO:70) 

CTACCTGGAGTATTACCTCACCCTCCTCGGCTGGATCATCAACAACGGGATCTGGAACATGATCTCGGATACTGGCCTGT 
TCGCGGTGCCGTTCGCGGCCATCGTGATGCGCGAATGGCTGAAAGTTCGTGGGGAAGGCGCCGACGAGGGCAACAAGGGA 
GTGCTGTCTCTCGCCCGCATCGAGACGCATATCTACGTCGGCTACATCGTGGTCGCCCTGGCGGGGATCCCGGTCGTCAA 
CGTGAGCTTCGACACCATCGAGTTCGACCAGACTCGCGCCCAGCAGTGCCAATACAATCTGCCGGCACCGGCGGACACCG 
GCTGGTCGAGCTCCTTCAGCAGCCTGGCCGGCAAGAGTGCGCAGATGCCGCTCTGGTGGGCGATGATGCACGCCCTGTCC 

AAGGGCTTCACCAGCGGCGCCATCGCGGCCATTCCGTGCGGCACGGATCTGCGGCAGATGCGAATGGAAGTGGACAACAC 
GCGCGTGAACAATCCGCTGCTGGCACAAGAAATCGCTGA 

>ORF8755c (SEQ ID NO:72) 

CAGTCTCTGGAAAAATCAGCGATTTCTTGTGCCAGQGCGGATTGTTCACGCGCGTGTTGTCCACTTCCATTCGQTCTG 

CCGCAGATCCGTGCCGCACGGAATGGCCGCGATGGCGCCGCTGGTGAAGCCCTTGGACAGGGCGTGCATCATCGCCCACC 

AGAGCGGCATCTGCGCACTCTTGCCGGCCAGGCTGCTGAAGGAGCTCGACCAGCCGGTGTCCGCCGGTGCCGGCAGATTG 

TATTGGCACTGCTGGGCGCGAGTCTGGTCGAACTCGATGGTGTCGAAGCTCACGTTGACGACCGGGATCCCCGCCAGGGC 
GACCACGATGTAG 
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>ORF9431c (SEQ ID NO:74) 

CTGAAACCAGAA6TCGACGAACATCATCGCAAA6AA6AC6ACC6TCATCGTCATG6CAACTTTCAGTT6ATAGSTGCCGA 
TGACCAGGACCATCGGAATGCAGATGACCATTGCCATCTTCAGGAACGACATCACCATCGGCAGTGCCTGGCGGACCATA 
TCCATCGCCGGGAAGTATGCCAAGCTGCCCACCGCAACGCCGAAGGTTCCTGCGGTTCTCGCGATGCCGTTCCACACGGT 
GCCGCCCACCTGCCCGCCGTAATCGGTGTAGACGTTACCCTTGACCTGGCTGGAGGGTGA 

>ORF9158 (SEQ ID NO:76) 

CGTCTACACCGATTACGGCGGGCAGGTGGGCGGCACCGTGTGGAACGGCATCGCGAGAACCGCAGGAACCTTCGGCGTTG 
CGGTGGGCAGCTTGGCATACTTCCCGGCGATGGATATGGTCCGCCAGGCACTGCCGATGGTGATGTCGTTCCTGAAGATG 
GCAATGGTCATCTGCATTCCGATGGTCCTGGTCATCGGCACCTATCAACTGAAAGTTGCCATGACGATGACGGTCGTCTT 
CTTTGCGATGATGTTCGTCGACTTCTGGTTTCAGTTAGCCAGATATATCGACAGCACGATACTTGA 

>ORF10125c (SEQ ID NO:78) 

GTGATAGCAGGATGCCTCCCTTTGGGAGCCAGGAGATTGATGATGAACGCGCACACCAACAAAGGCTTTGCCTCCCGGAT 
CGGTTTTGGTCTGGGTATGCTTGTGCGTTTCTGCCTGCATGATCGCCGTCCAGCTCTACGTTGGGTTAAGCGAGTTAGCC 
TATTCTTGTTAGTAGCTCTTGTAGTGTCACAGAATTTTATGTGGCTTGCTGGGGTATCAATGACTCTACTGTGTGTCTTT 
CTGGTGGGATTTGCCTTGGTTAAAGGGGACATCTCCGTCTCTAAAGGGTCTCCMGTCGAGATGTCTCAACTATGACTTC 
ACAAGCTGAAACTGAATCTGTAGCAGAGCTGTTTGACTATCAGGCAGCACACCATTACCGGGACTAG 

>ORF9770 (SEQ ID NO:80) 

TCAAACAGCTCTGCTACAGATTCAGTTTCAGCTTGTGAAGTCATAGTTGAGACATCTCGACTTGGAGACCCTTTAGAGAC 
GGAGATGTCCCCTTTAACCAAGGCAAATCCCACCAGAAAGACACACAGTAGAGTCATTGATACCCCAGCAAGCCACATAA 
AATTCTGTGACACTACAAGAGCTACTAACAAGAATAGGCTAACTCGCTTAACCCAACGTAGAGCTGGACGGCGATCATGC 
AGGCAGAAACGCACAAGCATACCCAGACCAAAACCGATCCGGGAGGCAAAGCCTTTGTTGGTGTGCGCGTTCATCATCAA 
TCTCCTGGCTCCCAAAGGGAGGCATCCTGCTATCACCTATACGCCGAAAAAGATGATTTGGCAAGCATTATGGCATATTA 
TGCCACTAGCTATCTGCCGACTGGAGTACCTCATGGCAACGCGAAACGTCGTCCTTCCCGATCCGCTGGAGCAGGATATC 
AACGAGCTGGTGGAGACCGGCCGCTATCAGAATCGCAGCGAAGTCATCCGGGCAGGCTTGCGCCTGCTGCTGCAACAGGA 
AGCCCAGATANGCGCCAAGCTCGAAACCCTCCGCAACGCAACATCCAGTGGGCTGATGCAACTGGAGCGCGGCGAGTACG 
ACGAGATCACCAGCGACGAACTGGCCCAATACCTCGACGAGCTCGGCAACCAGGCGAGCCACTGA 

>ORF9991 (SEQ ID NO:82) 

AGCTGGACGGCGATCATGCAGGCAGAMCGK!ACAAGCATACCCAGACCAAAACCGATCCGGGAGGCAAAGCCTTTGTTGG 
TGTGCGCGTTCATCATCAATCTCCTGGCTCCCAAAGGGAGGCATCCTGCTATCACCTATACGCCGAAAAAGATGATTTGG 
CAAGCATTATGGCATATTATGCCACTAGCTATCTGCCGACTGGAGTACCTCATGGCAACGCGAAACGTCGTCCTTCCCGA 
TCCGCTGGAGCAGGATATCAACGAGCTGGTGGAGACCGGCCGCTATCAGAATCGCAGCGAAGTCATCCGGGCAGGCTTGC 
GCCTGCTGCTGCAACAGGAAGCCCAGATMGCGCCAAGCTCGAMCCCTCCGCAACGCAACATCCAGTGGGCTGATGCAA 
CTGGAGCGCGGCGAGTACGACGAGATCACCAGCGACGAACTGGCCCAATACCTCGACGAGCTCGGCAACCAGGCGAGCCA 
CTGAAGCATGGCCAAGTACCGCATCTCTCATGA 

>ORF10765c (SEQ ID NO:84) 

CACCTGGTCTGTCGCCACCCGGTAGAAGACGAAGTGCCTGGGCCGAACAACCTTACCGACATTGGGCATCGAGTGGCAGT 
AAACGAGGTGGATGCTGCGCAGGCCAGCTCCCAGTTCTTCACGGCTGATGCTGCCTACCTGTTGTGGGTCTGTCGCAACT 
GCTTCCAGCGCCGCCCCTATGAGTGCCTGGTAACGTCGGCGCGCGGCATCGCCGAAGTGGTTGTGGGTGAAGCGCAGGAT 
ATCGACGATGTCCGCTTGGGCATCATGAGAGATGCGGTACTTGGCCATGCTTCAGTGGCTCGCCTGGTTGCCGAGCTCGT 
CGAGGTATTGGGCCAGTTCGTCGCTGGTGATCTCGTCGTACTCGCCGCGCTCCAGTTGCATCAGCCCACTGGATGTTGCG 
TTGCGGAGGGTTTCGAGCTTGGCGCNTATCTGGGCTTCCTGTTGCAGCAGCAGGCGCAAGCCTGCCCGGATGACTTCGCT 

GCGATTCTGATAGCGGCCGGTCTCCACCAGCTCGTTGATATCCTGCTCCAGCGGATCGGGAAGGACGACGTTTCGCGTTG 
CCATGAGGTACTCCAGTCGGCAGATAGCTAG 
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>ORF10475(SEQID NO:86) 

AGCAT6GCCAAGTACCGCATCTCTCATGATGCCCAAGCGGACATCGTCGATATCCTGCGCTTCACCCACAACCACTTCGG 
CGATGCCGCGCGCCGACGTTACCAGGCACTCATAGGGGCGGCGCTGGAAGCAGTTGCGACAGACCCACAACAGGTAGGCA 
GCATCAGCCGTGAAGAACTGGGAGCTGGCCTGCGCAGCATCCACCTCGTTTACTGCCACTCGATGCCCAATGTCGGTAAG 
GTTGTTCGGCCCAGGCACTTCGTCTTCTACCGGGTGGCGACAGACCAGGTGCTAGAGGTGGTTCGCGTGCTTCACGACGC 
CATGGATGTGGATCAACACCTGCCCCAACGATGA 

>ORni095c (SEQ ID NO:88) 

AGCCGCATGCAAGCGGTGGTCAGCACGAATGCAAATGCTTGGTCAGGGGGAATGCAATCGAGTGGTCAAGCCACTGCTAT 
TGCGCATCAACCATGGGGCACCTGCTGGTGGATGTTCACCCGTAGCCTTTTCGTGTTCGCCGGCGCGAACGCAGCCCTTT 
CTGCCTTCCGGCAGGCCCTTTCGGGTAGGGCTTTTACCCTTGTGAACCATTCCCTTCGCCCTTCAAGCCCATTTCCCCTT 
TGGGCCATTTGCTCCTGTTACAGTTGCTCATCGTTGGGGCAGGTGTTGATCCACATCCATGGCGTCGTGAAGCACGCGAA 
CCACCTCTAG 

>ORF11264 (SEQ ID NO:90) 

ACCGCGGTGCGGAGAGATCTCCTCAAACTGATGGGTTGCACGCATATCGAAGCAGATTACATAGGAGGCTTGCGCTGTTC 
AACAGCTCCTGAGGGGACTTGGGTTGCCCATGGTTTCCACGGCCCAATCGTTGACGTCATTGACGATTCCGCTGGCTTTT 
TCAGTACGCATCGCTTGGCGCTCCATTACCCAGCCCAATGCGGCCTTGCCGTTGACCAAGCGATTCCAAGGACTGCGATC 
CATGTAGCCAGCCCTCTAATGCATGTATGTATAGGTAAGGTCGTCGTTATTICGGCGTGGATGTGCTGA 

>ORFI1738 (SEQ ID NO:92) 

GAAGAGGTGATCATGAAGTTACAGGCATATCGGCTGCAGAACTACCGCCGGCTGCGCGATGTTGTCATCGAGCTCGATGA 
CGAAATTTCTATCTTTGTCGGTGCCAACAACAGCGGGAAGACATCCGCCGTCCAAGGCCTGTACTCAATGCTTCGCGGCG 
AAGTGAAGAAGTTCGAGCTCTTTGACTTCAGTGCGGCGCTGTGGGCCGAGATCGATGCGGTCGGCAGGACGCCCCCTGGC 
GATGAGGATGCGCCCAAAAGGTTACCGTCCATACTCTTGGATCTCTGGTTCCGCGTCGGTGAAGACGACCTCGCCACTGC 
GATGTCGCTGCTGCCGAGCACTGAGTGGGACGGCAAGTGCGTCGGGATCCGGGTAGCGTTCGAGCCTCGGGATGCCCACG 
AGCTCGTCTGGAAGTTCCATGAACTACATGAGAAGGCCAAC^CGCAGCTGTCGCGCTTGCGGCCAAGCGCAAGGCCGCC 
GGGGAGCAAGCTGTGGAGGCGGGCGCGGAAGACGCGGCTGCGGTGGTGGCCGATGCCGGCGAGTACAAGCCTTGGCCAGA 
AAGCCTGACGAAGTACCTCACAAAGGAACTGAGCAAGGAATACACCTTCCGCTACTACGTGCTCGATGAGCGGGCTTTTG 
TCGGCTATCAGGCAAGGGAGGCCGACTACGAGCCGCTACCCCTAGGCAAGGAGCCGGGCGGTGCAGCCATTCTCAAGTCG 
CTGGTGAGGGTCGACTTCCTGCGCGCGCAGCGGCACCTCGATGACCCAGATGCCGGTAGCTCTGATCGCGCAGAGAGCTT 
GTCGCGGCGTCTGAGCAGGTTCTATCACCGCAACCTGGAGAAGCGTGGCGACGACCATGCGGCTCTCAAGGCGCTAGATA 
CCTCOaMGAGCTGAACTTCCACCTGAAGGA 

GGCGTCAACMTCCGGAGATCGTGATTCGGGCGGCCTTGGATCCGACCACTGTCTTGGGGCAAGACGCCAAGGTTCACTA 
CGTGATCCCGGGCGTAGCTTCCGCCCAACTGCCAGACAGCTACAATGGCCTGGGGTTCAAGAATCTGGTCTACATGGTGG 
TTGAGCTGCTCGACTTGCACGAGCAGTGGAAAGCCGAGGATGACAAGCGAGCTCCGCTTCATTTGGTCTTCATTGAGGAG 
CCTGAGGCGCATCTGCACGCGCAGATCCAGCAGGTCTTCATCAGGAACGTTTTGCGCCTCCTTGAGGATGCTAACGATCA 
CGCGACTTTGTTCCACACGCAGCTCGTCATCACCACGCACTCCCCGCACATCCTCTATGAACGCGGATTCTCGCCCATTC 
GGTACTTCCGCCGCGTCAACGACCAGTTGGGCCATCACACGGATGTGCGCAATCTGTCGCTATTCAAAACGGGCGCGTCC 
GACGCTCCAGCGCGCGAATTCCTGCAGCGGTATCTGAAGCTGACGCACTGCGATCTCTTTTTTTCCGACGCGGTGATATT 
GGTGGAAGGCAACGTCGAGCGTCTGCTCCTGCCTGCAATGATCGAGTTGGTGGCCAAGCGCCTGCGTTCTTCCGCCCTAA 
CCATCCTTGAAGTCGGTGGTGCGTTCGCGCATCGGTTCCAGGAGCTGATCGCCTTCGTTGGGCTCACAACACTGGTCATC 
ACGGATCTGGACAGCGTGACGGTCAAGACGGACGCCGAGAAGGCCGCCGCGCAAGGCGCAGGCGCTGAGGGCGCCGTTGA 
CGGAGATGACGAGGACGAGGACGACGACCTGAAGCCCTTCGAGCTTGAAGACGACGACGAAGCAGAACCGAGTGGCAAGA 
AGAAGTCCAAGAAGCGTGGCAGCACCTGCCATGCACACGTGGAAGGTGCCGTCACGTCCAACCAAACCCTCATCAGCTGG 
ATCCCGAAGAAGCGGTCGATGGCAGAGCTCTGGGAAGTCACGGCGGAGCAAAAGACGCTGTCGCTGGCTGAGGATTCCAG 
CGCTGGGGTTCGGGTAGCTTACCAGACCAAGGTTTCGGTGACGGTGGGTGCGACGACATCACAGCTCTGCGGCCGCACAC 
TTGAGGAGGCCTTTGGTCTTGAGAACGCGGACTGGTGCCAGGCTGAGGCAAACCGGTCGGTCGGCCTCAAGCTCAAGCGC 
GCACCGAGCAGCCCTGAAGAGCTGGCTGAGAAGTTACACGATAGGGTGGTCGGCAAGAACTTCGACAAGACCCGCTTTGC 
GCTGGAGGTACTCGCAAGCGGGCCGCTCAATGGCTGGAAGGTTCCCGCGTACATCGCCGAGGGCTTGGCCTGGCTCGAAG 
CCAAAGTGGCCCACGAGCTTGAGGCGGATGCTGCCATCGCCACCGAGGTCGCGACTATTGAGCCGACTACAGCCGATGTT 
GTCGCTATCATTGTTGACCCGGGGCAGACGGCATGA 
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>ORF12348c (SEQ ID NO:94) 

CG6AAGGTGTATTCCTTGCTCAGTTCCTTT6T6A6GTACTTCGTCAGGCTTTCTG6CCAA6GCTT6TACTC6CCG6CATC 
GGCCACCACCGCAGCCGCGTCTTCCGCGCCCGCCTCCACAGCTTGCTCCCCGGCGGCCTTGCGCTTGGCCGCAAGCGCGA 
CAGCTGCGTTGTTGGCCTTCTCATGTAGTTCATGGAACTTCCAGACGAGCTCGTGGGCATCCCGAGGCTCGAACGCTACC 
CGGATCCCGACGCACTTGCCGTCCCACTCAGTGCTCGGCAGCAGCGACATCGCAGTGGCGAGGTCGTCTTCACCGACGCG 
GAACCAGAGATCCAAGAGTATGGACGGTAACCTTTTGGGCGCATCCTCATCGCCAGGGGGCGTCCTGCCGACCGCATCGA 
TCTCGGCCCACAGCGCCGCACTGAAGTCAAAGAGCTCGAACTTCTTCACTTCGCCGCGAAGCATTGAGTACAGGCCTTGG 
ACGGCGGATGTCTTCCCGCTGTTGTTGGCACCGACAAAGATAGAAATTTCGTCATCGAGCTCGATGACAACATCGCGCAG 
CCGGCGGTAG 

>ORF12314c (SEQ ID NO:96) 

GGTACTTCGTCAGGCTTTCTGGCCAAGGCTTGTACTCGCCGGCATCGGCCACCACCGCAGCCGCGTCTTCCGCGCCCGCC 
TCCACAGCTTGCTCCCCGGCGGCCTTGCGCTTGGCCGCAAGCGCGACAGCTGCGTTGTTGGCCTTCTCATGTAGTTCATG 
GAACTTCCAGACGAGCTCGTGGGCATCCCGAGGCTCGAACGCTACCCGGATCCCGACGCACTTGCCGTCCCACTCAGTGC 
TCGGCAGCAGCGACATCGCAGTGGCGAGGTCGTCTTCACCGACGCGGAACCAGAGATCCAAGAGTATGGACGGTAACCTT 
TTGGGCGCATCCTCATCGCCAGGGGGCGTCCTGCCGACCGCATCGATCTCGGCCCACAGCGCCGCACTGA 

>ORF13156c (SEQ ID NO:98) 

CGACAGATTGCGCACATCCGTGTGATGGCCCAACTGGTCGTTGACGCGGCGGAAGTACCGAATGGGCGAGAATCCGCGTT 
CATAGAGGATGTGCGGGGAGTGCGTGGTGATGACGAGCTGCGTGTGGAACAAAGTCGCGTGATCGTTAGCATCCTCAAGG 
AGGCGCAAAACGTTCCTGATGAAGACCTGCTGGATCTGCGCGTGCAGATGCGCCTCAGGCTCCTCAATGAAGACCAAATG 
AAGCGGAGCTCGCTTGTCATCCTCGGCTTTCCACTGCTCGTGCAAGTCGAGCAGCTCAACCACCATGTAGACCAGATTCT 
TGAACCCCAGGCCATTGTAGCTGTCTGGCAGTTGGGCGGAAGCTACGCCCGGGATCACGTAGTGAACCTTGGCGTCTTGC 
CCCAAGACAGTGGTCGGATCCAAGGCCGCCCGAATCACGATCTCCGGATTGTTGACGCCCGGATAGCCGAGCTTGGCCAG 
GCGCGTGAGGGTGTCATTGAAGACTTCCTTCAGGTGGAAGTTCAGCTCCTTCTCCGAGGTATCTAG 

>ORF12795 (SEQ ID N0:100) 

CTTCCGCCCAACTGCCAGACAGCTACAATGGCCTGGGGTTCAAGAATCTGGTCTACATGGTGGTTGAGCTGCTCGACTTG 
CACGAGCAGTGGAAAGCCGAGGATGACAAGCGAGCTCCGCTTCATTTGGTCTTCATTGAGGAGCCTGAGGCGCATCTGCA 
CGCGCAGATCCAGCAGGTCTTCATCAGGAACGTTTTGCGCCTCCTTGAGGATGCTAACGATCACGCGACTTTGTTCCACA 
CGCAGCTCGTCATCACCACGCACTCCCCGCACATCCTCTATGAACGCGGATTCTCGCCCATTCGGTACTTCCGCCGCGTC 
AACGACCAGTTGGGCCATCACACGGATGTGCGCAATCTGTCGCTATTCAAAACGGGCGCGTCCGACGCTCCAGCGCGCGA 
ATTCCTGCAGCGGTATCTGA 

>ORF12314c (SEQ ID NO:96) 

GGTACTTCGTCAGGCTTTCTGGCCAAGGCTTGTACTCGCCGGCATCGGCCACCACCGCAGCCGCGTCTTCCGCGCCCGCC 
TCCACAGCTTGCTCCCCGGCGGCCTTGCGCTTGGCCGCAAGCGCGACAGCTGCGTTGTTGGCCTTCTCATGTAGTTCATG 
GAACTTCCAGACGAGCTCGTGGGCATCCCGAGGCTCGAACGCTACCCGGATCCCGACGCACTTGCCGTCCCACTCAGTGC 
TCGGCAGCAGCGACATCGCAGTGGCGAGGTCGTCTTCACCGACGCGGAACCAGAGATCCAAGAGTATGGACGGTAACCTT 
TTGGGCGCATCCTCATCGCCAGGGGGCGTCCTGCCGACCGCATCGATCTCGGCCCACAGCGCCGCACTGA 

>ORF13156c (SEQ ID NO:98) 

CGACAGATTGCGCACATCCGTGTGATGGCCCAACTGGTCGTTGACGCGGCGGAAGTACCGAATGGGCGAGAATCCGCGTT 
CATAGAGGATGTGCGGGGAGTGCGTGGTGATGACGAGCTGCGTGTGGAACAAAGTCGCGTGATCGTTAGCATCCTCAAGG 
AGGCGCAAAACGTTCCTGATGAAGACCTGCTGGATCTGCGCGTGCAGATGCGCCTCAGGCTCCTCAATGAAGACCAAATG 
AAGCGGAGCTCGCTTGTCATCCTCGGCTTTCCACTGCTCGTGCAAGTCGAGCAGCTCAACCACCATGTAGACCAGATTCT 
TGAACCCCAGGCCATTGTAGCTGTCTGGCAGTTGGGCGGAAGCTACGCCCGGGATCACGTAGTGAACCTTGGCGTCTTGC 
CCCAAGACAGTGGTCGGATCCAAGGCCGCCCGAATCACGATCTCCGGATTGTTGACGCCCGGATAGCCGAGCTTGGCCAG 
GCGCGTGAGGGTGTCATTGAAGACTTCCTTCAGGTGGAAGTTCAGCTCCTTCTCCGAGGTATCTAG 
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>ORF12795 (SEQ ID NO:100) 

CTTCCGCCCAACTGCCAGACA6CTACAATG6CCTGGGGTTCAAGAATCT6GTCTACATGSTGGTTGAGCTGCTCGACTT6 

CACGAGCAGTGGAAAGCCGAGGATGACAAGCGAGCTCCGCTTCATTTGGTCTTCATTGAGGAGCCTGAGGCGCATCTGCA 

CGCGCAGATCCAGCAGGTCTTCATCAGGAACGTTTTGCGCCTCCTTGAGGATGCTAACGATCACGCGACTTTGTTCCACA 

CGCAGCTCGTCATCACCACGCACTCCCCGCACATCCTCTATGAACGCGGATTCTCGCCCATTCGGTACTTCCGCCGCGTC 

AACGACCAGTTGGGCCATCACACGGATGTGCGCAATCTGTCGCTATTCAAAACGGGCGCGTCCGACGCTCCAGCGCGCGA 
ATTCCTGCAGCGGTATCTGA 

>ORF13755c (SEQ ID NO:210) 

GCTACCCGAACCCCAGCGCTGGAATCCTCAGCCAGCGACAGCGTCTTTTGCTCCGCCGTGACTTCCCAGAGCTCTGCCAT 
CGACCGCTTCTTCGGGATCCAGCTGATGAGGGTTTGGTTGGACGTGACGGCACCTTCCACGTGTGCATGGCAGGTGCTGC 
CACGCTTCTTGGACTTCTTCTTGCCACTCGGTTCTGCTTCGTCGTCGTCTTCAAGCTCGAAGGGCTTCAGGTCGTCGTCC 
TCGTCCTCGTCATCTCCGTCAACGGCGCCCTCAGCGCCTGCGCCTTGCGCGGCGGCCTTCTCGGCGTCCGTCTTGACCGT 
CACGCTGTCCAGATCCGTGATGACCAGTGTTGTGAGCCCAACGAAGGCGATCAGCTCCTGGAACCGATGCGCGAACGCAC 
CACCGACTTCAAGGATGGTTAGGGCGGAAGAACGCAGGCGCTTGGCCACCAACTCGATCATTGCAGGCAGGAGCAGACGC 
TCGACGTTGCCTTCCACCAATATCACCGCGTCGGAAAAAAAGAGATCGCAGTGCGTCAGCTTCAGATACCGCTGCAGGAA 
TTCGCGCGCTGGAGCGTCGGACGCGCCCGTTTTGAATAGCGACAGATTGCGCACATCCGTGTGA 

>ORF13795c (SEQ ID NO:212) 

TGTCGTCGCACCCACCGTCACCGAAACCTTGGTCTGGTAAGCTACCCGAACCCCAGCGCTGGAATCCTCAGCCAGCGACA 
GCGTCTTTTGCTCCGCCGTGACTTCCCAGAGCTCTGCCATCGACCGCTTCTTCGGGATCCAGCTGATGAGGGTTTGGTTG 
GACGTGACGGCACCTTCCACGTGTGCATGGCAGGTGCTGCCACGCTTCTTGGACTTCTTCTTGCCACTCGGTTCTGCTTC 
GTCGTCGTCTTCAAGCTCGAAGGGCTTCAGGTCGTCGTCCTCGTCCTCGTCATCTCCGTCAACGGCGCCCTCAGCGCCTG 
CGCCTTGCGCGGCGGCCTTCTCGGCGTCCGTCTTGACCGTCACGCTGTCCAGATCCGTGATGACCAGTGTTGTGAGCCCA 
ACGAAGGCGATCAGCTCCTGGAACCGATGCGCGAACGCACCACCGACTTCAAGGATGGTTAG 

>ORF14727c (SEQ ID NO:214) 

CAGGAAGTCGGCGAGCTGAAGGATGTCCTCGTGGCCAAGTATGCCCTTGGCGTAGTCACTGCCCACGCCGTAGTTGAACG 

TCCTGACGCCGGCCACAGCCTCCAGGCTTCGGACATATCGCTCTTGGTCGGCCTTGTTCCTGTCGCGCGTGGTCTGCCGG 

ACACGCGAGCTGTAATTCTCGAACTCTTCTTCAAGTTCGGAGATCCGCCTGCGGATGTCGTTCTGCAGCCAAACCTTGAT 

GTCGGCCTGGAACGTCTTTGCAATAGACCAGTAAAAGCTGTGGATGGTCGAGACATGAACCAGCGGGTCATCGTTGACGT 

CCGCCAGGATTTCATTGGTGGCAAGGTCGGTATACGTGATGCACGCGACTATCTGCTTCCTCGCCCGCATGCTGGCGCCG 

TGCTCCGAGATCACCCAGTCCAGCGCCTTGATGAGGGAGGTGGTCTTGCCGGAACCTGCGCCAGCACGAACCACGAAGGG 

CTGCGGAGGCGTCGCTACAATGCATGCGTGGATCTCGCGGTCGGCGTCGGTATCTGGGCTATCAATTCGTCTGCTCATGC 

CGTCTGCCCCGGGTCAACAATGATAGCGACAACATCGGCTGTAGTCGGCTCAATAGTCGCGACCTCGGTGGCGATGGCAG 

CATCCGCCTCMGCTCGTGGGCQCTTTGGCTTCGAGCCAGGCCAAGCCCTCGGCGATGTACGCGGGAACCTTCCAGCCA 

TTGAGCGGCCCGCTTGCGAGTACCTCCAGCGCAAAGCGGGTCTTGTCGAAGTTCTTGCCGACCACCCTATCGTGTAACTT 

CTCAGCCAGCTCTTCAGGGCTGCTCGGTGCGCGCTTGAGCTTGAGGCCGACCGACCGGTTTGCCTCAGCCTGGCACCAGT 

CCGCGTTCTCAAGACCAAAGGCCTCCTCAAGTGTGCGGCCGCAGAGCTGTGATGTCGTCGCACCCACCGTCACCGAAACC 
TTGGTCTGGTAA 

>ORF13779 {SEQ ID NO:216) 

CGGTGGGTGCGACGACATCACAGCTCTGCGGCCGCACACTTGAGGAGGCCTTTGGTCTTGAGAACGCGGACTGGTGCCAG 

GCTGAGGCAAACCGGTCGGTCGGCCTCAAGCTCAAGCGCGCACCGAGCAGCCCTGAAGAGCTGGCTGAGAAGTTACACGA 

TAGGGTGGTCGGCAAGAACTTCGACAAGACCCGCTTTGCGCTGGAGGTACTCGCAAGCGGGCCGCTCAATGGCTGGAAGG 

TTCCCGCGTACATCGCCGAGGGCTTGGCCTGGCTCGAAGCCAAAGTGGCCCACGAGCTTGAGGCGGATGCTGCCATCGCC 

ACCGAGGTCGCGACTATTGAGCCGACTACAGCCGATGTTGTCGCTATCATTGTTGACCCGGGGCAGACGGCATGAGCAGA 
CGAATTGA 
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>ORF14293c (SEQ ID NO:218) 

6G6A6GT6GTCTT6CCGGAACCTGCGCCA6CAC6AACCACGAA6GGCT6CG6A6GCGTC6CTACAAT6CAT6C6TG6ATC 
TCGCGGTCGGCGTCGGTATCTGGGCTATCAATTCGTCTGCTCATGCCGTCTGCCCCGGGTCAACAATGATAGCGACAACA 
TCGGCTGTAGTCGGCTC^TAGTCGCGACCTCGGTGGC^^^ 

GAGCCAGGCCAAGCCCTCGGCGATGTACGCGGGAACCTTCCAGCCATTGAGCGGCCCGCTTGCGAGTACCTCCAGCGCAA 
AGCGGGTCTTGTCGAAGTTCTTGCCGACCACCCTATCGTGTAA 

>ORF14155 (SEQ ID NO:220) 

CCCGGGGCAGACGGCATGAGCAGACGAATTGATAGCCCAGATACCGACGCCGACCGCGAGATCCACGCATGCATTGTAGC 
GACGCCTCCGCAGCCCTTCGTGGTTCGTGCTGGCGCAGGTTCCGGCAAGACCACCTCCCTCATCAAGGCGCTGGACTGGG 
T(»TCTCGGAG<»CGGC^^ 

GAAATCCTGGCGGACGTCAACGATGACCCGCTGGTTCATGTCTCGACCATCCACAGCTTTTACTGGTCTATTGCAAAGAC 
GTTCCAGGCCGACATCAAGGTTTGGCTGCAGAACGACATCCGCAGGCGGATCTCCGAACTTGAAGAAGAGTTCGAGAATT 
ACAGCTCGCGTGTCCGGCAGACCACGCGCGACAGGAACAAGGCCGACCAAGAGCGATATGTCCGAAGCCTGGAGGCTGTG 
GCCGGCGTCAGGACGTTCAACTACGGCGTGGGCAGTGACTACGCCAAGGGCATACTTGGCCACGAGGACATCCTTCAGCT 
CGCCGACTTCCTGCTACAAAACCGCCCGCTGTTCCGACGGGTCGTGGCGCTGAGCTACCCGTTCGTGTTTATCGATGAGA 
GTCAGGACACGTTCCCGGGTGTAGTGAAGTCTTTCAAGGAAGTGGAAGCCCAGATGCAGGGCAAGTTCTGCCTTGGTTTT 
TTCGGCGACCCGATGCAGTCGATCTTCATGAGAGGCGCAGGGGACATCCAGCTTGAGGATCATTGGCGGGCCATCACGAA 
GCCGGAGAACTTTCGCTGCGCCAAGCAGATCCTTGACGTCGCCAATGCCGTGCGCGCGCAGGGCGATGGCATGGAGCAAG 
TCCGCGGGCTGCACGAGAGGGTCGATGGGAACCTCAAGCTGGTGGAGGGGTCGGCCCGGATGTTCGTCTTGCCGAACACG 
CTGAACCGAACCGAGGCTTTGGCAAGAGTCCGAGCGTGGAGCTCGGCGACGAACAACGACGAGGGTTGGACAACCCCAGA 
CATCGCAGTCAAGATTCTTGTCATCGTGCACCGCATGGCCGCAAACCGGCTTGGCTTCGGCGGCATCTACTCGGCGCTGA 
ACGACAAGACGTCGGATGCCATGAAGCAAGGGATGCAGGACGGCACCGGTTGGCCCGnCGACCCTTCCTAAGTTTTGCG 
CTACCGATCGTTGCAGCTGTGAAGGCCGGCAATGAGTTCGCGGCGATGAGCCTGCTCCGGGAATTCAGCCCGCGCCTGGC 
GCCTGCGGCTCTGACCGGCCGACGTGCCGCGGATGTATTGCGAGAGCTGCACGCTGCTGCGTCGAGGCTTGTCGCCATGC 
TGGACGAGGCAGGGACCACCATTGGTGACATAGCTCTCCATCTCTGTGACACGGGTCTTTTTGAGTTCGACGAGCGCTAT 
GCGCGTGTTCTTGGGTTTGTCAGGGATATTGCTGACACCGCTCAGGAGCCCGAGGCTGCTGATGCAGTTCCGGCCGAAGG 
ATTATCCTTGGACGCGACAATGGCCAAGTTCTTCAATTGCTCTGCGCAAGAGCTTTGGCCCTATGAACGCTATGTCTCAG 
AAGGCTCCCCCTATGCCACGCAGCACGGCGTGAAGGGAGCGCAGTTCGAACGCGTCATGGTGGTGATGGACGAGGAAGAA 
AGCGACTACCGAACGTACAACTACGAGCGTGTCTTCGCGAGTGCTGAGGCCCGCGCTGCAGATCGTGCACGAGCACTAGA 
CGGTGATGAAAACACTTGGAGCCGAACGCTGCGACTGCTTTACGTCTGCTGCACTCGTGCCCAGCGGGGGCTGGTACTAG 
CGTTCTTTGTCGCCGACCCTGCGACCACCCTGGAAAACGTCGTGGCGAGCGGGATCTTGCCGCGAAGCGCAGTCTTTACG 
CAGGAAGTGTTAGTTGGATGGCCATAG 

>ORF14360 (SEQ ID NO:222) 

TCGCGTGCATCACGTATACCGACCTTGCCACCAATGAAATCCTGGCGGACGTCAACGATGACCCGCTGGTTCATGTCTCG 
ACCATCCACAGCTTTTACTGGTCTATTGCAAAGACGTTCCAGGCCGACATCAAGGTTTGGCTGCAGAACGACATCCGCAG 
GCGGATCTCCGAACTTGAAGAAGAGTTCGAGAATTACAGCTCGCGTGTCCGGCAGACCACGCGCGACAGGAACAAGGCCG 
ACCAAGAGCGATATGTCCGAAGCCTGGAGGCTGTGGCCGGCGTCAGGACGTTCAACTACGGCGTGGGCAGTGACTACGCC 
AAGGGCATACTTGGCCACGAGGACATCCTTCAGCTCGCCGACTTCCTGCTACAAAACCGCCCGCTGTTCCGACGGGTCGT 
GGCGCTGA 

>ORF15342c (SEQ ID NO:224) 

GAAGGGTCGAACGGGCCAACCGGTGCCGTCCTGCATCCCTTGCTTCATGGCATCCGACGTCTTGTCGTTCAGCGCCGAGT 
AGATGCCGCCGAAGCCAAGCCGGTTTGCGGCCATGCGGTGCACGATGACAAGAATCTTGACTGCGATGTCTGGGGTTGTC 
CAACCCTCGTCGTTGTTCGTCGCCGAGCTCCACGCTCGGACTCTTGCCAAAGCCTCGGTTCGGTTCAGCGTGTTCGGCAA 
GACGAACATCCGGGCCGACCCCTCCACCAGCTTGAGGTTCCCATCGACCCTCTCGTGCAGCCCGCGGACTTGCTCCATGC 
CATCGCCCTGCGCGCGCACGGCATTGGCGACGTCAAGGATCTGCTTGGCGCAGCGAAAGTTCTCCGGCTTCGTGATGGCC 
CGCCAATGATCCTCAAGCTGGATGTCCCCTGCGCCTCTCATGAAGATCGACTGCATCGGGTCGCCGAAAAAACCAAGGCA 
GAACTTGCCCTGCATCTGGGCTTCCACTTCCTTGAAAGACTTCACTACACCCGGGAACGTGTCCTGACTCTCATCGATAA 
ACACGAACGGGTAGCTCAGCGCCACGACCCGTCGGAACAGCGGGCGGTTTTGTAG 
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>ORF15260c (SEQ ID NO:226) 

ATGCC6CCGAAGCCAAGCC6GTTTGC6GCCATGC6GTGCACGATGACAA6AATCTTGACTGCGATGTCT6GGGTT6TCCA 

ACCCTCGTCGTTGTTCGTCGCCGAGCTCCACGCTCGGACTCTTGCCAAAGCCTCGGTTCGGTTCAGCGTGTTCGGCAAGA 

CGAACATCCGGGCCGACCCCTCCACCAGCTTGAGGTTCCCATCGACCCTCTCGTGCAGCCCGCGGACTTGCTCCATGCCA 

TCGCCCTGCGCGCGCACGGCATTGGCGACGTCAAGGATCTGCTTGGCGCAGCGAAAGTTCTCCGGCTTCGTGATGGCCCG 
CCAATGA 

>ORF14991 (SEQ ID NO:228) 

CGTCGCCAATGCCGTGC(X:GCGCAGGGCGATGGC^TG(3AGCAAGTCCGCGGGCTGCACGAGAGGGTCGATGGGAACCTCA 
AGCTGGTGGAGGGGTCGGCCCGGATGTTCGTCTTGCCGAACACGCTGAACCGAACCGAGGCTTTGGCAAGAGTCCGAGCG 
TGGAGCTCGGCGACGAACAACGACGAGGGTTGGACAACCCCAGACATCGCAGTCAAGATTCTTGTCATCGTGCACCGCAT 
GGCCGCAAACCGGCTTGGCTTCGGCGGCATCTACTCGGCGCTGAACGACAAGACGTCGGATGCCATGAAGCAAGGGATGC 
AGGACGGCACCGGTTGGCCCGTTCGACCCTTCCTAAGTTTTGCGCTACCGATCGTTGCAGCTGTGAAGGCCGGCAATGA 

>ORF15590c (SEQ ID NO:230) 

CGCTCGTCGAACTCAAAAAGACCCGTGTCACAGAGATGGAGAGCTATGTCACCAATGGTGGTCCCTGCCTCGTCCAGCAT 

GGCGACAAGCCTCGACGCAGCAGCGTGCAGCTCTCGCAATACATCCGCGGCACGTCGGCCGGTCAGAGCCGCAGGCGCCA 

GGCGCGGGCTGAATTCCCGGAGCAGGCTCATCGCCGCGAACTCATTGCCGGCCTTCACAGCTGCAACGATCGGTAGCGCA 

AAACTTAGGAAGGGTCGAACGGGCCAACCGGTGCCGTCCTGCATCCCTTGCTTCATGGCATCCGACGTCTTGTCGTTCAG 
CGCCGAGTAG 

>ORF15675c (SEQ ID NO:232) 

TCCTTCGGCCGGAACTGCATCAGCAGCCTCGGGCTCCTGAGCGGTGTCAGCAATATCCCTGACAAACCCAAGAACACGCG 
CATAGC(X!TCGTCGMCTCAAAAAGACCCGTGTCACAGAGATGGAGAGCTATGTCACCAATGGTGGTCCCTGCCTCGTCC 
AGCATGGCGACAAGCCTCGACGCAGCAGCGTGCAGCTCTCGCAATACATCCGCGGCACGTCGGCCGGTCAGAGCCGCAGG 
CGCCAGGCGCGGGCTGAATTCCCGGAGCAGGCTCATCGCCGCGAACTCATTGCCGGCCTTCACAGCTGCAACGATCGGTA 



>ORF16405 (SEQ ID NO:234) 

ATCGACTCTTTGAGGAAATGCGTGGGAAGCCTGGAAAAGTGCTGTTTCGCCTGCAAAGAAATAATTCATGTTCATGCGAT 

TCGTTGTCGGCAGTGCGGCGAGTCCCAAGGCTGGCGAAGGTTCATGAGCTCTCCAACCTCAGTAGTTGCGTTGGTCCTTA 
GCCTTTTATCAATCGCTGCQCAAMCCTGTGGAGCGATO 

GGTGGTGATTACAAAGCTGCCCAGCTTATGTTGACC^TAACGGGTCAAAGCCTGCAACTTTAGTTTCCTTCGAAATCAC 
ATCGAMGCCACGACCAATACGAAAACATGGTTTTTGGTAAGCAATACGGATGGCGAAATTCTGGAGCCAGGCAAAACTT 
ACAAAATCAGGGCCTCAACCGATGAGTCTATCCCAAAAATTGTCGAAGCTGAGCGTCGGACGATTTTGAAGTCTCAGTAC 
GCACTTGCAGATAATTGCGAATTAACCGCTAAATACATAGAGGCCACGGGGCAGAAGGTTGTGCGTGTGCAACCGTTCAT 
GTGCGACACACCTCCTGAAAAGGGTGGCCTGCCCCCTGGTAMCCTGGCATACCCATTTGGTACCTTGGTCAAGAATGA 

>ORF16925 (SEQ ID NO:236) 

AGGCCACGGGGCAGAAGGTTGTGCGTGTGCAACCGTTCATGTGCGACACACCTCCTGAAAAGGGTGGCCTGCCCCCTGGT 

AAACCTGGCATACCCATTTGGTACCTTGGTCAAGAATGATGTTTTTATGCCGCCCTGGGCTTTGACGCCGATTAAGCAAA 

GCTGTGTTCGCTCATCCMTACGTCCCTCGCCCAGTTAAACGACTGTTATGTATATGGGTGCTGCCGCTACGTAATACCT 

TGGCCCTACGCATACGAAGTTAATTCTGAAAGCGTTCAATGGACAATCTTCCTCCTCGGCGTCGACTGCAGCGGTAAGGT 

GATCTACTTTCGAAACACTGCAAGGGTAGGTCCTTTTTTGGCAGCGTCCATATACCGACCGTGGTATGGCTCAGATGCGC 
TGGTACTGCATTTCACCAAATAA 
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>ORF17793c (SEQ ID NO:238) 

6CCAAAATGATTGTCATTGACAAAMTCTAGAACATCTT6TTGCGCAATGC6CTATATGTGAAAAAACTTTATTTGACGA 
GTTTTCTCTCAAGATTCAATTGGGGCATACATATTACGAGCCAAAATCTTTGCCCGCCTCTGCAAGCATTGTATATGGGT 
CGCATCCAGCCCCGTCGACGTTTTTTTTGGAACCAAAAGAAATTCAGCAAAATTTGGTGCTGAAATCCGGTGAGCAAGTC 
ATCACCTGCAGTAMCATCGATACAAAATACCGTTAGATTATTnGGTCTGGTGCAMCCAAAGGAACCCTTGCGCGATT 
GTTCGTGCAGGTAACCTGTAATGACGGTCAGGTAGAGCCGGGGTTCGACGGGTACGTAACCCTTGAAATCGTCAATATGT 
CGCCTTGGACGATAGAAATACCGGCCGTGAGCGATATAGCACAACTTTATTTGGTGAAATGCAGTACCAGCGCATCTGAG 
CCATACCACGGTCGGTATATGGACGCTGCCAAAAAAGGACCTACCCTTGCAGTGTTTCGAAAGTAG 

>ORF18548c (SEQ ID NO:240) 

AGGACAATGGCAGGGTGGCCGCGTCTCGCAGCCCAAGGACGAAGGACAAATCTGATGAGTGTGTTACAGATCAAAGGGCG 

TACAACGAAATCCCACACGGATTTTGACGCGGCATCGTACTCCAGCAACAGCCTTATACTCACTGATGCAGGGGACGAGA 

GAATTGAAGAGTTTTCCCTCGAATTGTCCGTGGGTGAAGGGTGGAGTGATAACTATTCTGGCAACGACAAAAACCTGTGG 

CGCATTGTCGATGGTATGACGATCAGGGGTCACGATTCTGTTGTGGTGGAGGCCGCTGAAGAAATCAAGGTGCCGCACAA 

TCGGTACGGCATAGTCCTACCTACGGGAAGTCTTTTTCTCTCACGCGGCGTGCTGGTTGCTTCGGCGAAGGTCGAACCTG 

CATTTGATGGCMGCTCAAGCTCAGGATATTCAACACCACCMCAAAAATGTCTGCCTTACCAAAGGCGAGAAK 

TCTGTGATTTTTTTCTCCACAGAATCGACGCACACCCAAAGCCCCATCAAGCGTGGCAGTGAAATATCGACGCTTCCCAT 

CACGCGGCGCGCGCGATTGAAGAAGTGGTTTTCGCTCAATCCCACCATATGGGTCGGGTGGACGCTGAATTTAATCGGAA 

GTTCCCTGGTGTCTTCTCTTATAATGTACGCCGTCTATTACAAGGTTGTGCTGGAACACCAGTCGCAGCCTCCTCAGTCA 

CAACAAAACGCTCAGCCATCGCCGAACGAAGTTAAGCCAAAATGA 

>ORF17875 (SEQ ID NO:242) 

ACGGCGTACATTATAAGAGAAGACACCAGGGAACTTCCGATTAAATTCAGCGTCCACCCGACCCATATGGTGGGATTGAG 

CGAAAACCACTTCTTCAATCGCGCGCGCCGCGTGATGGGAAGCGTCGATATTTCACTGCCACGCTTGATGGGGCTTTGGG 

TGTGCGTCGATTCTGTGGAGAAAAAAATCACAGAGCCAAGCTTCTCGCCTTTGGTAAGGCAGACATTTTTGTTGGTGGTG 

TTGAATATCCTGAGCTTGAGCTTGCCATCAAATGCAGGTTCGACCTTCGCCGAAGCAACCAGCACGCCGCGTGAGAGAAA 

AAGACTTCCCGTAGGTAGGACTATGCCGTACCGATTGTGCGGCACCTTGATTTCTTCAGCGGCCTCCACCACAACAGAAT 
CGTGA 

>ORF18479 (SEQ ID NO:244) 

TCTGTAACACACTCATCAGATTTGTCCTTCGTCCTTGGGCTGCGAGACGCGGCCACCCTGCCATTGTCCTTTATACCGGC 
CGATATCCCCGGATACCGCCTGAAAGATGACGTGCGCAAAGCGTGCACCAATCTGAATTTCAAACGCCTCGCTGTGATTG 
TTGGTGAGCGCGAACGTCATCGGCCCTACATAACCTGGAGGCAGCACACTGGAACTGAACGTTATCCCGCTTCTGAACAG 
CGTGCTTCTCGGAAAAAAAAGCGCCGCCAGATCTTCCGGCAGATCGAATTCTTCCATGGTGCTCGCCAGATAAGTCTTGC 
CCGGTTCCATGACGAAGCAGTCATCCGGGTCTGCGAGCACGACCTCGCTGGCAGGGGTGCGTCGCGTAGATTCTCGCAAG 
CTTCCACCCCCTACTGTCAGGCGAGAGAGGCCTGCGAGTCTGAGGTCAAATCCAACGCCTTCCGGGGTGGTCMCTCACG 
GTGGGCAAGGTGCTTGATTAG 

>ORF19027c (SEQ ID NO:246) 

ATGATTTACTCACCGCACTCGCTCCTGAAACTGGTCCGGGATGGAAAACTAATCAAGCACCTTGCCCACCGTGAGTTGAC 
CACCCCGGAAGGCGTTGGATTTGACCTCAGACTCGCAGGCCTCTCTCGCCTGACAGTAGGGGGTGGAAGCTTGCGAGAAT 
CTACGCGACGCACCCCTGCCAGCGAGGTCGTGCTCGCAGACCCGGATGACTGCTTCGTCATGGAACCGGGCAAGACTTAT 
CTGGCGAGCACCATGGAAGAATTCGATCTGCCGGAAGATCTGGCGGCGCTTTTTTTTCCGAGAAGCACGCTGTTCAGAAG 
CGGGATAACGTTCAGTTCCAGTGTGCTGCCTCCAGGTTATGTAGGGCCGATGACGTTCGCGCTCACCAACAATCACAGCG 
AGGCGTTTGAAATTCAGATTGGTGCACGCTTTGCGCACGTCATCTTTCAGGCGGTATCCGGGGATATCGGCCGGTATAAA 
GGACAATGGCAGGGTGGCCGCGTCTCGCAGCCCAAGGACGAAGGACAAATCTGA 

>ORF19305 (SEQ ID NO:248) 

TGGCCGTTCTCTGCCTGTCGCCTCTTTGGCATGACTGGTCAAGTCGGATGCAAACGGTGGTCAGCACCAATGCAATTGGG 

TGGTCATGTGCGATGCAATTACGCAGTTGAGCCTGGCCCAGTTCCTCCCMGCAAAGCATAAGACCAAGATGGCACATTG 

CCAACAAAATACCCTTCCCCGCTACCGTTGTTTTATCGTTGTTGCCAGCCCTGATCTGGCGGAAAAGCCCGCTCCATGAA 

TCGTCATGGAGCCTCCCATGTTTCAACTCCTTTCCTGGATATCCAGGAAGCCGTCCCCCACCCCAACAACCAAAGCTGCC 
CCAGGGGGATTCATCCTTCCTCTGA 
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>ORF19519(SEQIDNO:250) 

TCTGGCGGAAAAGCCCGCTCCAT6AATCGTCAT6GA6CCTCCCATGTTTCAACTCCTTTCCTGGATATCCAGGAAGCCGT 

CCCCCACCCCAACAACCAAAGCTGCCCCAGGGGGATTCATCCTTCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTCGC 

CGCCG(KAGCTACTGGAGMCATCTG(K^GCGCGCCTC(K:TATCCMGCAGCAATTC(»G^GATCTACCGGCGGCCACT 

GGCCAACTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGATC 

ACOmtKAGATCGTGGCCTACG&CTCAAGG 

TCAGCCCAGGCTGA 

>ORF19544(SEQID NO:252) 

ATCGTCATGGAGCCTCCCATGTTTCMCTCCTTTCCTGGATATCCAGGAAGCCGTCCCC(^CCCCAACMCCAAAGCTGC 
CCCAGGGGGATTCATCCTTCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTCGCCGCCGGCAGCTACTGGAGAACATCT 
GGCAGCGCGCCTCGCTATCCAAGCAGCAATTCGAGGAGATCTACCGGCGGCCACTGGCCAACTATGCCGAGCTGGTCCAG 
CAGCTCCCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGATCACGGCCTGGAGATCGTGGCCTACGC 
ACTCAAGGTACGGCAGACCTACCTGCTCCCGATCGGCGCAGCGCCGGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGG 
CCGCCGCGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCGTCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGC 
ACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAGTCCCGCGAATACCAGCTCCACGG 
CGCTGCCTCAGCACTTCTCATCCACCAACTGCTACCGCGCACTGCACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGG 
CTCAATTGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGCGAGATCATCGTGAAGGCAGACCAGGCC 
TCAGTTGCACAGGAGCTAGGAGGCAATCCGGATCGAGCTCTGGCTGCACCGAAGCAGTCGCTGCAGCGGCAGTTGGCAGA 
CGGCCTTCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCMCCTAGCGGCCCGTCTGATGGATGGCTGACCCAGGACG 
CACTCTGGCTGGTGAGCAAGCCTGCTGCCGATCAACTGAGAGCCTACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCC 
TCTAACGCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGC 
CACGGTAGACAACGGTGCTGGATGGAGAAACAAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTG 
CCGAGCGCCCCTCACCCTACAGCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGT 
GAMTTCCCAACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACGAAGATGATGCTCCATCAACCTGCGCCGAGCGnGC 
GAAACCGGCAAACGAGACGCAGGCGATTGCGAAACCCTCMCTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTC 
TT(MTAATATCMTTCGCCACTAGAAGAGCTAGACACTAGCCACGACTC(K:CGGCTGCCTCTCCTACGAACACACGCGGG 
GAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGATTGCGCTCCTGAAGCAATTGAAGATGTATTTATGCC 
TAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACG 
ACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAG 
CATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGG 
GCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCAT(^GGTTTCTGGTCCTCG(^AGACGAAAGAGCTCAAGG 
CCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCC 
GAAGGAGGTGTGGAATGA 

>ORF20008 (SEQ ID NO.-254) 

GCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAGTCCCGCG 
AATACCAGCTCCACGGCGCTGCCTCAGCACTTCTCATCCACCAACTGCTACCGCGCACTGCACTCGATTGGCTCAGTCGC 
TTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGCGAGATCATCGT 
GAAGGCAGACCAGGCCTCAGTTGCACAGGAGCTAGGAGGCAATCCGGATCGAGCTCTGGCTGCACCGAAGCAGTCGCTGC 
AGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCAACCTAG 

>ORF20623c (SEQ ID NO:256) 

CGTGAACTTGTTTCTCCATCCAGCACCGTTGTCTACCGTGGCCGTCCAAATGGCCTTGTCCTCGGCATTTGTCTGGATGA 
CGGCTTGGTCCTGGAGCATGCTGAAGAACGGCGCGTTAGAGGAGGGCACCCCATCGATACCCTGGGCCAGCAGGTAGGCT 
CTCAGTTGATCGGCAGCAGGCTTGCTCACCAGCCAGAGTGCGTCCTGGGTCAGCCATCCATCAGACGGGCCGCTAGGTTG 
ATTCAACTTGAACTTGTCCTTCACCAAGAAGCGAAGGCCGTCTGCCAACTGCCGCTGCAGCGACTGCTTCGGTGCAGCCA 
GAGCTCGATCCGGATTGCCTCCTAG 
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>ORF21210c (SEQ ID NO:258) 

CGCTTGAAAATTCCTGGC6TGACCA6CATG6CGGTCCC6TCTAC66TATGCACCAAAGCCTTG6T6TC6TTGATGAACA6 
GCGACGGGCCGCGATGCCAGATTTCATCCAACCAACGAATCCCTGTCCCAGATCAGTACTTCTGCTAGGCATAAATACAT 
CTTCAATTGCTTCAGGAGCGCAATCTGTTGGCTCCTTGGTCCCTAGTGGCTGCTGTAGGTTCTCCTCCCCGCGTGTGTTC 
GTAGGAGAGGCAGCCGGCGAGTCGTGGCTAGTGTCTAGCTCTTCTAGTGGCGAATTGATATTACCAAGAAGTGCATACAA 
ATCGTCTGTTTCTTCTTGA 

>ORF21493c (SEQ ID NO:260) 

GCTGCTGCGGCGTCATTCCACACCTCCTTCGGCATCGGTGATGACCGTGAGGCTTGGGTTGTCCAGAGGCTGCTCAGGGA 
ACAGCAATTTGGGATCCTGGAGCAGGTAGGCCTTGAGCTCTTTCGTCTTGCGAGGACCAGAAACCTTGATGGTCCAGATG 
TTCAGGTTTTTACTGGTCTTCCGATGAAGCCCCTGTTTTTCGAACGCGCGCTGCACCAGCTTCCAGCCGGTCGTCTCCTT 
GGCTTGGGCCAGTTTTTCAAGCACCGGATGCTCTTGGACATAGCGCTTGAAAATTCCTGGCGTGACCAGCATGGCGGTCC 
CGTCTACGGTATGCACCAAAGCCTTGGTGTCGTTGATGAACAGGCGACGGGCCGCGATGCCAGATTTCATCCAACCAACG 
AATCCCTGTCCCAGATCAGTACTTCTGCTAGGCATAAATACATCTTCAATTGCTTCAGGAGCGCAATCTGTTGGCTCCTT 
GGTCCCTAG 

>ORF21333 (SEQ ID NO:262) 

ACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTG 
TTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGACGCCGCAGCAG 
CTCACCGAGGAGTACATCTTCGCGCACGATCTCCGGGMGCCAGCGCGAAGATCTACCGCGCCGCGACCAAGGCGCTGCT 
CAAGCACTTCGGTCCTACGGCAACCGTACAGGACGTGGACCACCGGGCTGTCCTGGGATGGCGACGCAAGGTACTGGAAC 
AAGGCCTGTCCAAGCGGAGCTGGAACACGTACTCGAATCATCTGCGGACGATCTGGGGCTATGCCATCGAGCATGA 

>ORF22074c (SEQ ID NO:264) 

GTGAGGCACAAGCCCCTCCGTTATTGGCACTACGAACTCTTTGTGAGTCTTCTCTGTCTCGCCGCGGATGAGGATCAGTT 
GATTTTCCCAGTCGATGTCGCGCTTGCGGATGCACAACAGCGCATTCAACCGGATGCCGGTGAAGTAGAAGACCTCAAAC 
GTGCAAAGCCAGAACCAGGCGGGCGTGATCCGTGCGCGTTCGCCGGTGCAGCGCTCTGCGCCGTCCTGCATGTTGAGCCA 
ATTGCGGGCGAGCAGGATGGCTTCGGCGGCGACGGTTTTGCTTGCTCGCCTGGGGGGAATGACGGTGGTCTTTCTGAACG 
GGTTGACTTGGGAGTGCGTCACCAACTCATGCTCGATGGCATAGCCCCAGATCGTCCGCAGATGATTCGAGTACGTGTTC 
CAGCTCCGCTTGGACAGGCCTTGTTCCAGTACCTTGCGTCGCCATCCCAGGACAGCCCGGTGGTCCACGTCCTGTACGGT 
TGCCGTAGGACCGAAGTGCTTGAGCAGCGCCTTGGTCGCGGCGCGGTAGATCTTCGCGCTGGCTTCCCGGAGATCGTGCG 
CGAAGATGTACTCCTCGGTGAGCTGCTGCGGCGTCATTCCACACCTCCTTCGGCATCGGTGATGACCGTGAGGCTTGGGT 
TGTCCAGAGGCTGCTCAGGGAACAGCAATTTGGGATCCTGGAGCAGGTAG 

>ORF21421 (SEQ ID NO:266) 

GCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGACGCCGCAGCAGCTCACCGA 
GGAGTACATCTTCGCGCACGATCTCCGGGAAGCCAGCGCGAAGATCTACCGCGCCGCGACCAAGGCGCTGCTCAAGCACT 
TCGGTCCTACGGCAACCGTACAGGACGTGGACCACCGGGCTGTCCTGGGATGGCGACGCAAGGTACTGGAACAAGGCCTG 
TCCAAGCGGAGCTGGAACACGTACTCGAATCATCTGCGGACGATCTGGGGCTATGCCATCGAGCATGAGTTGGTGACGCA 
CTCCCAAGTCAACCCGTTCAGAAAGACCACCGTCATTCCCCCCAGGCGAGCAAGCAAAACCGTCGCCGCCGAAGCCATCC 
TGCTCGCCCGCAATTGGCTCAACATGCAGGACGGCGCAGAGCGCTGCACCGGCGAACGCGCACGGATCACGCCCGCCTGG 
TTCTGGCTTTGCACGTTTGAGGTCTTCTACTTCACCGGCATCCGGTTGAATGCGCTGTTGTGCATCCGCAAGCGCGACAT 
CGACTGGGAAAATCAACTGATCCTCATCCGCGGCGAGACAGAGAAGACTCACAAAGAGTTCGTAGTGCCAATAACGGAGG 
GGCTTGTGCCTCACCTATCGAGGCTCCTGCAAGAGGCCGATAGAGCCGGATTCGCCGATGACGACCAGTTGTTCAACGTC 
AACCGGTTCTCACCGCACTACAAGAGCAAGGTGATGAACTCCGACCAGGTCGAAGCCATGTACCGGAAGTTGACCGAGAA 
GGTTGGGGTGCGGATGACCCCGCACCGTTTCCGGCACACCCTGGCCACCGACTTGATGAAGGCACCCGAGCGGAACATTC 
ACCTCACGAAGTGCCTGCTCAACCACTCGAATATCCAGACCACGATGAGCTACATCGAGGCCGACTACGATCACATGCGT 
GCCGTGCTGCATGCTAGAAGCCTGGCCCAAGGCGCGCTGGAGAATGTCAGGAAGGTGGATTACAGCGGCTCCCCGCAAGC 
CTCTGCCAAACCGAAGCCATGCGGGCAACCTCTCGCTCGAGTGAGTGAAGCGCCGCCACCGGAGGCCAGGACAGAGCCTG 
CAGAACCAAGGGAGCACACGCCAGGGACAGGCATTCAGGGAGGTCCAACCGCGTGGGAAGCAGATGCGCTACCACAGCCA 
CCTGACACCTTCGAACCAAGCGTGCTGTTCACTCTGATGGCTCAAAACTTATCGAACCGTGCCGCCTCGGCATCCGCGGC 
TCCCGCTGCAACAAGCGGATCAGGCGGATGGGGATCTGCCGCCCGAAGCAATCTCGCCTAG 
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>ORF22074c (SEQ ID NO:264) 

6TGA66CACAAGCCCCTCCGTTATTGGCACTAC6AACTCTTTGT6A6TCTTCTCT6TCTC6CC6CGGATGAGGATCAGTT 
GATTTTCCCAGTCGATGTCGCGCTTGCGGATGCACAACAGCGCATTCAACCGGATGCCGGTGAAGTAGAAGACCTCAAAC 
GTGCAMGCCAGAACCAGGCGGGCGTGATCCGTGCGCGTTCGCCGGTGCAGCGCTCTGCGCCGTCCTGCATGTTGAGCCA 
ATTGCGGGCGAGCAGGATGGCTTCGGCGGCGACGGTTTTGCTTGCTCGCCTGGGGGGAATGACGGTGGTCTTTCTGAACG 
GGTTGACTTGGGAGTGCGTCACCAACTCATGCTCGATGGCATAGCCCCAGATCGTCCGCAGATGATTCGAGTACGTGTTC 
CAGCTCCGCTTGGACAGGCCTTGTTCCAGTACCTTGCGTCGCCATCCCAGGACAGCCCGGTGGTCCACGTCCTGTACGGT 
TGCCGTAGGACCGAAGTGCTTGAGCAGCGCCTTGGTCGCGGCGCGGTAGATCTTCGCGCTGGCTTCCCGGAGATCGTGCG 
CGAAGATGTACTCCTCGGTGAGCTGCTGCGGCGTCATTCCACACCTCCTTCGGCATCGGTGATGACCGTGAGGCTTGGGT 
TGTCCAGAGGCTGCTCAGGGAACAGCAATTTGGGATCCTGGAGCAGGTAG 

>ORF21421 (SEQIDNO:266) 

GCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGACGCCGCAGCAGCTCACCGA 
GGAGTACATCTTCGCGCACGATCTCCGGGAAGCCAGCGCGAAGATCTACCGCGCCGCGACCAAGGCGCTGCTCAAGCACT 
TCGGTCCTACGGCAACCGTACAGGACGTGGACCACCGGGCTGTCCTGGGATGGCGACGCAAGGTACTGGAACAAGGCCTG 
TCCAAGCGGAGCTGGAACACGTACTCGAATCATCTGCGGACGATCTGGGGCTATGCCATCGAGCATGAGTTGGTGACGCA 
CTCCCAAGTCAACCCGTTCAGAAAGACCACCGTCATTCCCCCCAGGCGAGCAAGCAAAACCGTCGCCGCCGAAGCCATCC 
TGCTCGCCCGCAATTGGCTCAACATGCAGGACGGCGCAGAGCGCTGCACCGGCGAACGCGCACGGATCACGCCCGCCTGG 
TTCTGGCTTTGCACGTTTGAGGTCTTCTACTTCACCGGCATCCGGTTGAATGCGCTGTTGTGCATCCGCAAGCGCGACAT 
CGACTGGGAAAATCAACTGATCCTCATCCGCGGCGAGACAGAGAAGACTCACAAAGAGTTCGTAGTGCCAATAACGGAGG 
GGCTTGTGCCTCACCTATCGAGGCTCCTGCAAGAGGCCGATAGAGCCGGATTCGCCGATGACGACCAGTTGTTCAACGTC 
AACCGGTTCTCACCGCACTACAAGAGCAAGGTGATGAACTCCGACCAGGTCGAAGCCATGTACCGGAAGTTGACCGAGAA 
GGTTGGGGTGCGGATGACCCCGCACCGTTTCCGGCACACCCTGGCCACCGACTTGATGAAGGCACCCGAGCGGAACATTC 
ACCTCACGAAGTGCCTGCTCAACCACTCGAATATCCAGACCACGATGAGCTACATCGAGGCCGACTACGATCACATGCGT 
GCCGTGCTGCATGCTAGAAGCCTGGCCCAAGGCGCGCTGGAGAATGTCAGGAAGGTGGATTACAGCGGCTCCCCGCAAGC 
CTCTGCCAAACCGAAGCCATGCGGGCAACCTCTCGCTCGAGTGAGTGAAGCGCCGCCACCGGAGGCCAGGACAGAGCCTG 
CAGAACCAAGGGAGCACACGCCAGGGACAGGCATTCAGGGAGGTCCAACCGCGTGGGAAGCAGATGCGCTACCACAGCCA 
CCTGACACCTTCGMCCMGCGTGCTGTTCACTCTGATGGCTCAAMCTTATCGAACCGTGCCGCCTCGGCATCCGCGGC 
TCCCGCTGCAACAAGCGGATCAGGCGGATGGGGATCTGCCGCCCGAAGCAATCTCGCCTAG 

>ORF22608c (SEQ ID NO:268) 

CGCATCTGCTTCCCACGCGGTTGGACCTCCCTGAATGCCTGTCCCTGGCGTGTGCTCCCTTGGTTCTGCAGGCTCTGTCC 
TGGCCTCCGGTGGCGGCGCTTCACTCACTCGAGCGAGAGGTTGCCCGCATGGCTTCGGTTTGGCAGAGGCTTGCGGGGAG 
CCGCTGTAATCCACCTTCCTGACATTCTCCAGCGCGCCTTGGGCCAGGCTTCTAGCATGCAGCACGGCACGCATGTGATC 
GTAGTCGGCCTCGATGTAGCTCATCGTGGTCTGGATATTCGAGTGGTTGAGCAGGCACTTCGTGAGGTGAATGTTCCGCT 
CGGGTGCCTTCATCAAGTCGGTGGCCAGGGTGTGCCGGAAACGGTGCGGGGTCATCCGCACCCCAACCTTCTCGGTCAAC 
TTCCGGTACATGGCTTCGACCTGGTCGGAGTTCATCACCTTGCTCTTGTAGTGCGGTGA 

>ORF22626 (SEQ ID NO:270) 

CACCTTCGAACCAAGCGTGCTGTTCACTCTGATGGCTCAAAACTTATCGAACCGTGCCGCCTCGGCATCCGCGGCTCCCG 
CTGCAACAAGCGGATCAGGCGGATGGGGATCTGCCGCCCGAAGCAATCTCGCCTAGCGATACCGGTACTGAGGGCCGGCT 
ACCGGACGAAAGGTAGCCGTGCCTTCCAGCAGATCGTTAGGCCTGTAGGAAAAATCTGGAATTACCGAGAGCGCCTGGAT 
TCCAGCGCCGGCATGCTGGCAGAGCCAGCGCAATTTCAAGGCCAATACCACAGTACCCTCTGTAATCGCTGA 
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>ORF23228 (SEQ ID NO:272) 

AGAGATTCGAACTCCC6ACATCCT6CTCCCAM6CAGGCGCGCTACCGGACTGCGCTATACCCCGATT66AATTTGGCTC 
CGCGACCTGGACTCGAACCAGGGACCCAATGATTAACAGTCATTTGCTCTACCGACTGAGCTATCGCGGMCGTCTTTCT 
TCCAACCCTGGACGCTTCCGGTGTTGCTGGATTCGCGTCTCAGAGGCGCGCCATTTTACGGATGCGCGCGGGCATGTCAA 
CCCTCTGATCCAAAAAGTTTTTCTTCTTTTTCCACGAGCGACAAAACGGCCCTTCCACTGCATGCGGCAGCGCTCTCGCG 
CCTACCGGACGCCCATGAAAAAGCCCCGCCGAAGCGGGGCTTTCCCTGTCCGCCCCCGAAGAGGTCAGGCGAAGACGATC 
TCGTCGCCTTCCACCTTCGCCGAGATACTGGCACCCGGCGCGAATTTGCCGGCCAGGATCAGTTGCGCCAGCGGGTTCTC 
GATCCAGCGCTGGATGGCCCGCTTCAGCGGGCGTGCGCCATAGACCGGGTCGAAGCCGACGGCAATCAGCTTGTCCAGCG 
CCTCCTGGCTCAGTTCCAGGCTCAGCTCGCGCTCGGCCAGGCGCTTGCGCAGGCGACCGAGCTGGATCTCGGCGATGCCG 
GCGATCTGCTCGCGAGCCAGCGGCTCGAACACCACCACTTCGTCGATCCGGTTGATGAATTCCGGACGGAAGTGCGCATT 
GACCGCGTCCATCACTGCGGCACGTTGCGCCTCGCGGTCGCCGGCCAGCTCCTGGATCTGCGCCGAACCGAGGTTGGAGG 
TCATCACCACCACGGTGTTGCGGAAGTCCACCGTACGCCCGTGACTGTCGGTCAGGCGTCCGTCCTCGAGCACCTGGAGG 
AGAATGTTGAATACATCCGGATGGGCCTTCTCCACCTCGTCCAGCAGCACCACCGAGTAGGGCTTGCGGCGGATCGCCTC 
GGTCAGGTAGCCGCCTTCCTCGAAGCCGACGTAGCCCGGAGGCGCGCCGATCAGGCGGGCCACCGAGTGTTTCTCCATGA 
ACTC(MACATATCTATCC(K:ACCAGCGCCTCCTC(MTATC(3^GAGGAACTCGGCCAGCGCCTTGCACAACTCGGTCTTG 
CCCACCCCGGTCGGGCCGAGGAAGAGGAACGAGCCGCTCGGCCGGTTCGGATCGGCGAGGCCGGCGCGCGAACGGCGCAC 
GGCGTTGGACACGGCGACTACCGCCTCGTCCTGGCCGATCACTCGCCGATGCAGCTCCTGCTCCATGCGCAGCAGCTTCT 
CGCGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTGGAAACCACTTCGGCGATTTCCTCGTCGGTCACC 
TTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACCATCTGCAGGCTGCGTTCCAGGTCCGGGATGGTCTG 
GTACTGGATGCGCGCCATGCTCTCGAGGTCGCCCTTGCGCCGCGCCGCCTCCATCTCCTGCTTGGCCTGCTCGATCTTCT 
GCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCCTCGAGGTCGGCGTATTCGCGCTCGAGC 
TTGACGATATCCTCCTCCAGCTTGGCCAGGCGCTTCCTGGTGGCTTCGTCGTCTTCCTTCTTCAGCGCCTCGCGCTCGAT 
CTTCAGCTGGATCAGGCGACGGTCGAGACGATCCAGTTCCTCCGGCTTGGAGTCGATCTCCATGCGGATGCGGCTGGCGG 
CCTCGTCGATCAGGTCGATGGCCTTGTCCGGCAGTTGCCGATCGGTGATGTAGCGGTGCGACAGCTTGGCCGCGGCGATG 
ATCGCGCCGTCGGTGATGCTCACCCCGTGGTGCACTTCATAGCGTTCCTTGAGGCCACGGAGGATGGCGATGGTGTCTTC 
CTCGCTCGGTTCGTCCACCAGCACCTTCTGGAAGCGGCGCTCCAGCGCGGCATCCTTCTCGATGTACTGGCGATACTCGT 
CGAGGGTAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCAGAGCCGGCTTGAGCATGTTGCCGGCGTCCATGGCACCT 
TCCGCCTTGCCGGCGCCGACCATGGTGTGCAGTTCGTCGATGAACAGGATGACCCGGCCTTCCTGCTTGCCCAGTTCGTT 
GAGGACCGCCTTCAGGCGTTCCTCGAACTCGCCGCGGAACTTGGCACCGGCGATCAGCGCCCCCATGTCCAGGGCCAGCA 
GGCGCTTGTCCTTGAGGCCGTCCGGCACTTCGCCGTTGATGATGCGCTGGGCCAGGCCCTCGACGATGGCGGTCTTGCCG 
ACGCCGGGTTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGCCGCTGCAGGACCTGGATGGTCCGGCGGATCTCGTCGTC 
GCGACCGATCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTTGGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCG 
ACTCCTCGACGTTCGGGTCGTTCACCGCTTCGCCGCCACGCAGGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACACG 
CCCTGGCCGAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCCATCGCGGCCAGCAATACCAGCTCGCTGGAGATGAACTG 
GTCGCCCTTCTGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAGGCGTGCGAGATCCTGGGACAGGTTCACGTCGCCGGTCG 
GGCTCTGGATCTTCGGCAGCGCGTCGAGTTCTTTGTTGAGGCCGCTGCGCAGGGCGGCGATATCGAAGCCGACCTGCATC 
AGCAGGGGCTTGATCGAACCGCCTTGCTGCTCGAGCAGGGCGGAAAGCAGGTGCACCGGCTCGATGGCCGGATGGTCATG 
GCCAACGGCCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAGCTTGCTGGTCAAACGGTCTATTCGCATGGGTCGTCCTT 
CCTTCTATAG 

>ORF23367 (SEQ ID NO:274) 

GCTATCGCGGAACGTCTTTCTTCCAACCCTGGACGCTTCCGGTGTTGCTGGATTCGCGTCTCAGAGGCGCGCCATTTTAC 
GGATGCGCGCGGGCATGTCAACCCTCTGATCCAAAAAGTTTTTCTTCTTTTTCCACGAGCGACAAAACGGCCCTTCCACT 
GCATGCGGCAGCGCTCTCGCGCCTACCGGACGCCCATGAAAAAGCCCCGCCGAAGCGGGGCTTTCCCTGTCCGCCCCCGA 
AGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTCGCCGAGATACTGGCACCCGGCGCGAATTTGCCGGCCAGGAT 
CAGTTGCGCCAGCGGGTTCTCGATCCAGCGCTGGATGGCCCGCTTCAGCGGGCGTGCGCCATAG 
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>ORF25103c (SEQ ID NO:276) 

A6TGCACCAC6G66T6A6CATCACCGAC6GC6CGATCATCGCC6C66CCAAGCT6TCGCACCGCTACATCACC6ATCGGC 
AACTGCCGGACAAGGCCATCGACCTGATCGACGAGGCCGCCAGCCGCATCCGCATGGAGATCGACTCCAAGCCGGAGGAA 
CTGGATCGTCTCGACCGTCGCCTGATCCAGCTGAAGATCGAGCGCGAGGCGCTGAAGAAGGAAGACGACGAAGCCACCAG 
GAAGCGCCTGGCCAAGCTGGAGGAGGATATCGTCAAGCTCGAGCGCGAATACGCCGACCTCGAGGAGATCTGGAAGTCCG 
AGAAGGCCGAGGTGCAGGGCTCGGCGCAGATCCAGCAGAAGATCGAGCAGGCCAAGCAGGAGATGGAGGCGGCGCGGCGC 
AAGGGCGACCTCGAGAGCATGGCGCGCATCCAGTACCAGACCATCCCGGACCTGGAACGCAGCCTGCAGATGGTCGACCA 
GCACGGCAAGACCGAGAACCAGTTGCTGCGCAACAAGGTGACCGACGAGGAAATCGCCGAAGTGGTTTCCAAGTGGACCG 
GTATCCCGGTGTCGAAGATGCTCGAGGGCGAGCGCGAGAAGCTGCTGCGCATGGAGCAGGAGCTGCATCGGCGAGTGATC 
GGCCAGGACGAGGCGGTAGTCGCCGTGTCCAACGCCGTGCGCCGTTCGCGCGCCGGCCTCGCCGATCCGAACCGGCCGAG 
CGGCTCGTTCCTCTTCCTCGGCCCGACCGGGGTGGGCAAGACCGAGTTGTGCAAGGCGCTGGCCGAGTTCCTCTTCGATA 
CCGAGGAGGCGCTGGTGCGGATAGATATGTCCGAGTTCATGGAGAAACACTCGGTGGCCCGCCTGATCGGCGCGCCTCCG 
GGCTACGTCGGCTTCGAGGAAGGCGGCTACCTGACCGAGGCGATCCGCCGCAAGCCCTACTCGGTGGTGCTGCTGGACGA 
GGTGGAGAAGGCCCATCCGGATGTATTCAACATTCTCCTCCAGGTGCTCGAGGACGGACGCCTGACCGACAGTCACGGGC 
GTACGGTGGACTTCCGCAACACCGTGGTGGTGATGACCTCCAACCTCGGTTCGGCGCAGATCCAGGAGCTGGCCGGCGAC 
CGCGAGGCGCAACGTGCCGCAGTGATGGACGCGGTCAATGCGCACTTCCGTCCGGAATTCATCAACCGGATCGACGAAGT 
GGTGGTGTTCGAGCCGCTGGCTCGCGAGCAGATCGCCGGCATCGCCGAGATCCAGCTCGGTCGCCTGCGCAAGCGCCTGG 
CCGAGCGCGAGCTGAGCCTGGAACTGAGCCAGGAGGCGCTGGACAAGCTGATTGCCGTCGGCTTCGACCCGGTCTATGGC 
GCACGCCCGCTGAAGCGGGCCATCCAGCGCTGGATCGAGAACCCGCTGGCGCAACTGATCCTGGCCGGCAAATTCGCGCC 
GGGTGCCAGTATCTCGGCGAAGGTGGAAGGCGACGAGATCGTCTTCGCCTGACCTCTTCGGGGGCGGACAGGGAAAGCCC 
CGCTTCGGCGGGGCTTTTTCATGGGCGTCCGGTAGGCGCGAGAGCGCTGCCGCATGCAGTGGAAGGGCCGTTTTGTCGCT 
CGTGGAAAAAGAAGAAAAACTTTTTGGATCAGAGGGTTGACATGCCCGCGCGCATCCGTAAAATGGCGCGCCTCTGA 

>ORF23556 (SEQ ID NO:278) 

AAAAGCCCCGCCGAAGCGGGGCTTTCCCTGTCCGCCCCCGAAGAGGTCAGGCGAAGACGATCTCGTCGCCTTCCACCTTC 
GCCGAGATACTGGCACCCGGCGCGAATTTGCCGGCCAGGATCAGTTGCGCCAGCGGGTTCTCGATCCAGCGCTGGATGGC 
CCGCTTCAGCGGGCGTGCGCCATAGACCGGGTCGAAGCCGACGGCAATCAGCTTGTCCAGCGCCTCCTGGCTCAGTTCCA 
GGCTCAGCTCGCGCTCGGCCAGGCGCTTGCGCAGGCGACCGAGCTGGATCTCGGCGATGCCGGCGATCTGCTCGCGAGCC 
AGCGGCTCGAACACCACCACTTCGTCGATCCGGTTGA 
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>ORF26191c ISEQ ID NO:280) 

AA6GAA66AC6ACCCATGC6AATAGACCGTTT6ACCAGCAAGCTGCAACTGSCGCTCTCC6ACGCCCA6TCCCTG6CCGT 
TGGCCATGACCATCCGGCCATCGAGCCGGTGCACCTGCTTTCCGCCCTGCTCGAGCAGCAAGGCGGTTCGATCAAGCCCC 
TGCTGATGCAGGTCGGCTTCGATATCGCCGCCCTGCGCAGCGGCCTCAACAAAGAACTCGACGCGCTGCCGAAGATCCAG 
AGCCCGACCGGCGACGTGAACCTGTCCCAGGATCTCGCACGCCTGCTCAACCAGGCTGACCGCCTGGCCCAGCAGAAGGG 
CGACCAGTTCATCTCCAGCGAGCTGGTATTGCTGGCCGCGATGGACGAGAACACCAGGCTCGGCAAGCTGCTGCTCGGCC 
AGGGCGTGTCGCGCAAGGCGCTGGAGAATGCCGTGGCCAACCTGCGTGGCGGCGAAGCGGTGAACGACCCGAACGTCGAG 
GAGTCGCGCCAGGCGCTGGACAAGTACACCGTCGAQTGACCAAGCKKCtSAGGAAGGCAAGCTCGACCCGGTGATCCM 
TCGCGACGACGAGATCCGCCGGACCATCCAGGTCCTGCAGCGGCGGACCAAGAACAACCCGGTGCTGATCGGCGAACCCG 
GCGTCGGCAAGACCGCCATCGTCGAGGGCCTGGCCCAGCGCATCATCAACGGCGAAGTGCCGGACGGCCTCAAGGACAAG 
CGCCTGCTGGCCCTGGACATGGGGGCGCTGATCGCCGGTGCCAAGTTCCGCGGCGAGTTCGAGGAACGCCTGAAGGCGGT 
CCTCAACGAACTGGGCAAGCAGGAAGGCCGGGTCATCCTGTTCATCGACGAACTGCACACCATGGTCGGCGCCGGCAAGG 
CGGAAGGTGCCATGGACGCCGGCAACATGCTCAAGCCGGCTCTGGCGCGCGGCGAGCTGCACTGCGTCGGTGCTACTACC 
CTCGACGAGTATCGCCAGTACATCGAGAAGGATGCCGCGCTGGAGCGCCGCTTCCAGAAGGTGCTGGTGGACGAACCGAG 
CGAGGAAGACACCATCGCCATCCTCCGTGGCCTCAAGGAACGCTATGAAGTGCACCACGGGGTGAGCATCACCGACGGCG 
CGATCATCGCCGCGGCCAAGCTGTCGCACCGCTACATCACCGATCGGCAACTGCCGGACAAGGCCATCGACCTGATCGAC 
GAGGCCGCCAGCCGCATCCGCATGGAGATCGACTCCAAGCCGGAGGAACTGGATCGTCTCGACCGTCGCCTGATCCAGCT 
GAAGATCGAGCGCGAGGCGCTGAAGAAGGAAGACGACGAAGCCACCAGGAAGCGCCTGGCCAAGCTGGAGGAGGATATCG 
TCAAOTCGAGC(X:GMTACGCC(»CCTCGAGGAGATCTGGAAGTCCGAGAAGGCCGAGGTGCAGGGCTCGGCGCAGATC 
CAGCAGAAGATCGAGCAGGCCAAGCAGGAGATGGAGGCGGCGCGGCGCAAGGGCGACCTCGAGAGCATGGCGCGCATCCA 
GTACCAGACCATCCCGGACCTGGAACGCAGCCTGCAGATGGTCGACCAGCACGGCAAGACCGAGAACCAGTTGCTGCGCA 
ACAAGGTGACCGACGAGGAAATCGCCGAAGTGGTTTCCAAGTGGACCGGTATCCCGGTGTCGAAGATGCTCGAGGGCGAG 
CGCGAGAAGCTGCTGCGCATGGAGCAGGAGCTGCATCGGCGAGTGATCGGCCAGGACGAGGCGGTAGTCGCCGTGTCCAA 
CGCCGTGCGCCGTTCGCGCGCCGGCCTCGCCGATCCGAACCGGCCGAGCGGCTCGTTCCTCTTCCTCGGCCCGACCGGGG 
TGGGCAAGACCGAGTTGTGCAAGGCGCTGGCCGAGTTCCTCTTCGATACCGAGGAGGCGCTGGTGCGGATAGATATGTCC 
GAGTTCATGGAGAAACACTCGGTGGCCCGCCTGATCGGCGCGCCTCCGGGCTACGTCGGCTTCGAGGAAGGCGGCTACCT 
GACCGAGGCGATCCGCCGCAAGCCCTACTCGGTGGTGCTGCTGGACGAGGTGGAGAAGGCCCATCCGGATGTATTCAACA 
TTCTCCTCCAGGTGCTCGAGGACGGACGCCTGACCGACAGTCACGGGCGTACGGTGGACTTCCGCAACACCGTGGTGGTG 
ATGACCTCCAACCTCGGTTCGGCGCAGATCCAGGAGCTGGCCGGCGACCGCGAGGCGCAACGTGCCGCAGTGATGGACGC 
GGTCAATGCGCACTTCCGTCCGGAATTCATCAACCGGATCGACGAAGTGGTGGTGTTCGAGCCGCTGGCTCGCGAGCAGA 
TCGCCGGCATCGCCGAGATCCAGCTCGGTCGCCTGCGCAAGCGCCTGGCCGAGCGCGAGCTGAGCCTGGAACTGAGCCAG 
GAGGCGCTGGACAAGCTGATTGCCGTCGGCTTCGACCCGGTCTATGGCGCACGCCCGCTGAAGCGGGCCATCCAGCGCTG 
GATCGAGAACCCGCTGGCGCAACTGATCCTGGCCGGCAAATTCGCGCCGGGTGCCAGTATCTCGGCGAAGGTGGAAGGCG 
ACGAGATCGTCTTCGCCTGA 

>ORF23751 (SEQ ID NO:282) 

ACCGGGTCGAAGCCGACGGCAATCAGCTTGTCCAGCGCCTCCTGGCTCAGTTCCAGGCTCAGCTCGCGCTCGGCCAGGCG 
CTTGCGCAGGCGACCGAGCTGGATCTCGGCGATGCCGGCGATCTGCTCGCGAGCCAGCGGCTCGAACACCACCACTTCGT 
CGATCCGGTTGATGAATTCCGGACGGAAGTGCGCATTGACCGCGTCCATCACTGCGGCACGTTGCGCCTCGCGGTCGCCG 
GCCAGCTCCTGGATCTGCGCCGAACCGAGGTTGGAGGTCATCACCACCACGGTGTTGCGGAAGTCCACCGTACGCCCGTG 
A 

>ORF24222 (SEQ ID NO:284) 

CCCGGAGGCGCGCCGATCAGGCGGGCCACCGAGTGTTTCTCCATGAACTCGGACATATCTATCCGCACCAGCGCCTCCTC 
GGTATCGAAGAGGAACTCGGCCAGCGCCTTGCACAACTCGGTCTTGCCCACCCCGGTCGGGCCGAGGAAGAGGAACGAGC 
CGCTCGGCCGGTTCGGATCGGCGAGGCCGGCGCGCGAACGGCGCACGGCGTTGGACACGGCGACTACCGCCTCGTCCTGG 
CCGATCACTCGCCGATGCAGCTCCTGCTCCATGCGCAGCAGCTTCTCGCGCTCGCCCTCGAGCATCTTCGACACCGGGAT 
ACCGGTCCACTTGGAAACCACTTCGGCGATTTCCTCGTCGGTCACCTTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGT 
GCTGGTCGACCATCTGCAGGCTGCGTTCCAGGTCCGGGATGGTCTGGTACTGGATGCGCGCCATGCTCTCGAGGTCGCCC 
TTGCGCCGCGCCGCCTCCATCTCCTGCTTGGCCTGCTCGATCTTCTGCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTT 
CTCGGACTTCCAGATCTCCTCGAGGTCGGCGTATTCGCGCTCGAGCTTGACGATATCCTCCTCCAGCTTGGCCAGGCGCT 
TCCTGGTGGCTTCGTCGTCTTCCTTCTTCAGCGCCTCGCGCTCGATCTTCAGCTGGATCAGGCGACGGTCGAGACGATCC 
AGTTCCTCCGGCTTGGAGTCGATCTCCATGCGGATGCGGCTGGCGGCCTCGTCGATCAGGTCGATGGCCTTGTCCGGCAG 
TTGCCGATCGGTGATGTAG 
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>ORF24368 (SEQ ID NO:286) 

ACTCG6ACATATCTATCCGCACCAGCGCCTCCTCGGTATCGAAGAGGAACTCGGCCA6CGCCTTGCACAACTCGGTCTTG 
CCCACCCCGGTCGGGCCGAGGAAGAGGAACGAGCCGCTCGGCCGGTTCGGATCGGCGAGGCCGGCGCGCGAACGGCGCAC 
GGCGTTGGACACGGCGACTACCGCCTCGTCCTGGCCGATCACTCGCCGATGCAGCTCCTGCTCCATGCGCAGCAGCTTCT 
CGCGCTCGCCCTCGAGCATCTTCGACACCGGGATACCGGTCCACTTGGAAACCACTTCGGCGATTTCCTCGTCGGTCACC 
TTGTTGCGCAGCAACTGGTTCTCGGTCTTGCCGTGCTGGTCGACCATCTGCAGGCTGCGTTCCAGGTCCGGGATGGTCTG 
GTACTGGATGCGCGCCATGCTCTCGAGGTCGCCCTTGCGCCGCGCCGCCTCCATCTCCTGCTTGGCCTGCTCGATCTTCT 
GCTGGATCTGCGCCGAGCCCTGCACCTCGGCCTTCTCGGACTTCCAGATCTCCTCGAGGTCGGCGTATTCGCGCTCGAGC 
TTGA 

>ORF24888c (SEQ ID NO:288) 

AGAAGGAAGACGACGAAGCCACCAGGAAGCGCCTGGCCAAGCTGGAGGAGGATATCGTCAAGCTCGAGCGCGAATACGCC 
GACCTCGAGGAGATCTGGAAGTCCGAGAAGGCCGAGGTGCAGGGCTCGGCGCAGATCCAGCAGAAGATCGAGCAGGCCAA 
GCAGGAGATGGAGGCGGCGCGGCGCAAGGGCGACCTCGAGAGCATGGCGCGCATCCAGTACCAGACCATCCCGGACCTGG 
AACGCAGCCTGCAGATGGTCGACCAGCACGGCAAGACCGAGAACCAGTTGCTGCGCAACAAGGTGA 

>ORF25398c (SEQ ID NO:290) 

AGGCGGTCCTCAACGAACTGGGCAAGCAGGAAGGCCGGGTCATCCTGTTCATCGACGAACTGCACACCATGGTCGGCGCC 
GGCAAGGCGGAAGGTGCCATGGACGCCGGCAACATGCTCAAGCCGGCTCTGGCGCGCGGCGAGCTGCACTGCGTCGGTGC 
TACTACCCTCGACGAGTATCGCCAGTACATCGAGAAGGATGCCGCGCTGGAGCGCCGCTTCCAGAAGGTGCTGGTGGACG 
AACCGAGCGAGGAAGACACCATCGCCATCCTCCGTGGCCTCAAGGAACGCTATGAAGTGCACCACGGGGTGA 

>ORF25892c (SEQ ID NO:292) 

CCGCCTGGCCCAGCAGAAGGGCGACCAGTTCATCTCCAGCGAGCTGGTATTGCTGGCCGCGATGGACGAGAACACCAGGC 
TCGGCAAGCTGCTGCTCGGCCAGGGCGTGTCGCGCAAGGCGCTGGAGAATGCCGTGGCCAACCTGCGTGGCGGCGAAGCG 
GTGAACGACCCGAACGTCGAGGAGTCGCGCCAGGCGCTGGACAAGTACACCGTCGACATGACCAAGCGCGCCGAGGAAGG 
CAAGCTCGACCCGGTGATCGGTCGCGACGACGAGATCCGCCGGACCATCCAGGTCCTGCAGCGGCGGACCAAGAACAACC 
CGGTGCTGATCGGCGAACCCGGCGTCGGCAAGACCGCCATCGTCGAGGGCCTGGCCCAGCGCATCATCAACGGCGAAGTG 
CCGGACGGCCTCAAGGACAAGCGCCTGCTGGCCCTGGACATGGGGGCGCTGATCGCCGGTGCCAAGTTCCGCGGCGAGTT 
CGAGGAACGCCTGAAGGCGGTCCTCAACGAACTGGGCAAGCAGGAAGGCCGGGTCATCCTGTTCATCGACGAACTGCACA 
CCATGGTCGGCGCCGGCAAGGCGGAAGGTGCCATGGACGCCGGCAACATGCTCAAGCCGGCTCTGGCGCGCGGCGAGCTG 
CACTGCGTCGGTGCTACTACCCTCGACGAGTATCGCCAGTACATCGAGAAGGATGCCGCGCTGGAGCGCCGCTTCCAGAA 
GGTGCTGGTGGACGAACCGAGCGAGGAAGACACCATCGCCATCCTCCGTGGCCTCAAGGAACGCTATGA 

>ORF25110 (SEQ ID NO:294) 

CGTTCCTTGAGGCCACGGAGGATGGCGATGGTGTCTTCCTCGCTCGGTTCGTCCACCAGCACCTTCTGGAAGCGGCGCTC 
CAGCGCGGCATCCTTCTCGATGTACTGGCGATACTCGTCGAGGGTAGTAGCACCGACGCAGTGCAGCTCGCCGCGCGCCA 
GAGCCGGCTTGAGCATGTTGCCGGCGTCCATGGCACCTTCCGCCTTGCCGGCGCCGACCATGGTGTGCAGTTCGTCGATG 
AACAGGATGACCCGGCCTTCCTGCTTGCCCAGTTCGTTGAGGACCGCCTTCAGGCGTTCCTCGAACTCGCCGCGGAACTT 
GGCACCGGCGATCAGCGCCCCCATGTCCAGGGCCAGCAGGCGCTTGTCCTTGAGGCCGTCCGGCACTTCGCCGTTGATGA 
TGCGCTGGGCCAGGCCCTCGACGATGGCGGTCTTGCCGACGCCGGGTTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGC 
CGCTGCAGGACCTGGATGGTCCGGCGGATCTCGTCGTCGCGACCGATCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTT 
GGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCGACGTTCGGGTCGTTCACCGCTTCGCCGCCACGCA 
GGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACACGCCCTGGCCGAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCC 
ATCGCGGCCAGCAATACCAGCTCGCTGGAGATGAACTGGTCGCCCTTCTGCTGGGCCAGGCGGTCAGCCTGGTTGAGCAG 
GCGTGCGAGATCCTGGGACAGGTTCACGTCGCCGGTCGGGCTCTGGATCTTCGGCAGCGCGTCGAGTTCTTTGTTGAGGC 
C(KITGCGCAGGGCGGCGATATCGAAGCCGACCTGCATCAGCAGGGGCTTGATCGAACCGCCTTGCTGCTCGAGCAGGGCG 
GAAAGCAGGTGCACCGGCTCGATGGCCGGATGGTCATGGCCAACGGCCAGGGACTGGGCGTCGGAGAGCGCCAGTTGCAG 
CTTGCTGGTCAAACGGTCTATTCGCATGGGTCGTCCTTCCTTCTATAGAGCGGGCCGGAACGATGGGTGTCCCTGA 

>ORF25510 (SEQ ID NO:296) 

TGCGCTGGGCCAGGCCCTCGACGATGGCGGTCTTGCCGACGCCGGGTTCGCCGATCAGCACCGGGTTGTTCTTGGTCCGC 
CGCTGCAGGACCTGGATGGTCCGGCGGATCTCGTCGTCGCGACCGATCACCGGGTCGAGCTTGCCTTCCTCGGCGCGCTT 
GGTCATGTCGACGGTGTACTTGTCCAGCGCCTGGCGCGACTCCTCGACGTTCGGGTCGTTCACCGCTTCGCCGCCACGCA 
GGTTGGCCACGGCATTCTCCAGCGCCTTGCGCGACACGCCCTGGCCGAGCAGCAGCTTGCCGAGCCTGGTGTTCTCGTCC 
ATCGCGGCCAGCAATACCAGCTCGCTGGAGATGA pj g 
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>ORF26762c (SEQ ID NO:298) 

CCGCCGACTGCCT6CC6GC6TT6TTCT6CGACCGCTCGGGCACCCGG6T6GCCGCGGCCCATGCCGGCTGGCGC666CT6 
GCGGCGGGCGTGCTGGAGGCGACGGTGGACAGCCTGGGCGTGCCCGGCGACGAACTGCTGGTCTGGCTGGGGCCGGCGAT 
CGGCCCGCAGGCCTTCGAGGTCGGCGGCGAGGTCCGCGATGCATTCGTCGCTGCGCACGCCGAGGCGCGCTCGGCTTTCG 
TACCTAGCGCCAATCCGGGCCGCTTCATGGCCGAQ^TCTACCGACTCGCGCGGATCCGCCTGGGCGCCCATGGCGTCACC 
GCCGTGCATGGCGGCGGCTTCTGCACCTTCAGCGATACCGCGCGCTTCTATTCCTACCGCCGCTCGTCGCGTACCGGCCG 
TTTTGCCAGCCTGGTCTGGCTCCAGGACTAGGCCCGCGCAGGTTATCCGGCGGCAACTGACCGATGTCACGGTCCGGTCG 
CTTGAACCGCGGAAAATCGCCCTTATCTACTGA 

>ORF26257(SEQIDNO:300) 

ATAAGGGCGATTTTCCGCGGTTCAAGCGACCGGACCGTGACATCGGTCAGTTGCCGCCGGATAACCTGCGCGGGCCTAGT 
CCTGGAGCCAGACCAGGCTGGCAAAACGGCCGGTACGCGACGAGCGGCGGTAGGAATAGAAGCGCGCGGTATCGCTGAAG 
GTGCAGAAGCCGCCGCCATGCACGGCGGTGACGCCATGGGCGCCCAGGCGGATCCGCGCGAGTCGGTAGATGTCGGCCAT 
GAAGCGGCCCGGATTGGCGCTAGGTACGAAAGCCGAGCGCGCCTCGGCGTGCGCAGCGACGAATGCATCGCGGACCTCGC 
CGCC(^CCTCGM(MCCTGCG(MCCGATCGCC(WCCCC^GCCAGACCAG(^GTTCGTCGCCGGGCAC(X:C(»GGCTGTCC 
ACCGTCGCCTCCAGCACGCCCGCCGCCAGCCCGCGCCAGCCGGCATGGGCCGCGGCCACCCGGGTGCCCGAGCGGTCGCA 
GAACAACGCCGGCAGGCAGTCGGCGGTCATGATCGTACAGGCGACGCCCGGCATCGCGCTCCAGCTGGCGTCGGCCCTGA 
GCACCGGTTCGGGTCGGCCTCCACCACGTCACTCCGTGCACCTATTCCAACCAGCTCGGCCGGCATTCCAGACGCTCGGT 
CAGGCGTCGGCGGTTTTATTCCACGGCGCGCGGATCGTCGTAGACGTGGGCGCCAAGGTTCAGACTGTCGAAGGGTGCCT 
GGCTGA 

>ORF26844c (SEQ ID NO:302) 

CGTGGTGGAGGCCGACCCGAACCGGTGCTCAGGGCCGACGCCAGCTGGAGCGCGATGCCGGGCGTCGCCTGTACGATCAT 
GACCGCCGACTGCCTGCCGGCGTTGTTCTGCGACCGCTCGGGCACCCGGGTGGCCGCGGCCCATGCCGGCTGGCGCGGGC 
TGGCGGCGGGCGTGCTGGAGGCGACGGTGGACAGCCTGGGCGTGCCCGGCGACGAACTGCTGGTCTGGCTGGGGCCGGCG 
ATCGGCCCGCAGGCCTTCGAGGTCGGCGGCGAGGTCCGCGATGCATTCGTCGCTGCGCACGCCGAGGCGCGCTCGGCTTT 
CGTACCTAGCGCCAATCCGGGCCGCTTCATGGCCGACATCTACCGACTCGCGCGGATCCGCCTGGGCGCCCATGGCGTCA 
CCGCCGTGCATGGCGGCGGCTTCTGCACCTTCAGCGATACCGCGCGCTTCTATTCCTACCGCCGCTCGTCGCGTACCGGC 
CGTTTTGCCAGCCTGGTCTGGCTCCAGGACTAG 

>ORF26486 (SEQ ID NO:304) 

ATGTCGGCCATGAAGCGGCCCGGATTGGCGCTAGGTACGAAAGCCGAGCGCGCCTCGGCGTGCGCAGCGACGAATGCATC 
GCGGACCTCGCCGCCGACCTCGAAGGCCTGCGGGCCGATCGCCGGCCCCAGCCAGACCAGCAGTTCGTCGCCGGGCACGC 
CCAGGCTGTCCACCGTCGCCTCCAGCACGCCCGCCGCCAGCCCGCGCCAGCCGGCATGGGCCGCGGCCACCCGGGTGCCC 
GAGCGGTCGCAGAACAACGCCGGCAGGCAGTCGGCGGTCATGATCGTACAGGCGACGCCCGGCATCGCGCTCCAGCTGGC 
GTCGGCCCTGAGCACCGGTTCGGGTCGGCCTCCACCACGTCACTCCGTGCACCTATTCCAACCAGCTCGGCCGGCATTCC 
AGACGCTCGGTCAGGCGTCGGCGGTTTTATTCCACGGCGCGCGGATCGTCGTAGACGTGGGCGCCAAGGTTCAGACTGTC 
GAAGGGTGCCTGGCTGACCCCGCCACTGCGCGTGGTCACGCAGGCCCGCACACGGGCCGGCGCCGGCCAGTCGGGGGTCA 
GCCAGGCGTTCAACCGACGAACGCCTCGCGATCCTGGCGCAACAGGCTGAGCAGCCAGAGGAATTCTTCCGGCAGCGGCG 
ATTCCCACTTCATGCGCACGCCGGTGGCCGGGTGA 

>ORF26857c (SEQ ID NO:306) 

GTGCACGGAGTGACGTGGTGGAGGCCGACCCGAACCGGTGCTCAGGGCCGACGCCAGCTGGAGCGCGATGCCGGGCGTCG 
CCTGTACGATCATGACCGCCGACTGCCTGCCGGCGTTGTTCTGCGACCGCTCGGGCACCCGGGTGGCCGCGGCCCATGCC 
GGCTGGCGCGGGCTGGCGGCGGGCGTGCTGGAGGCGACGGTGGACAGCCTGGGCGTGCCCGGCGACGAACTGCTGGTCTG 
GCTGGGGCCGGCGATCGGCCCGCAGGCCTTCGAGGTCGGCGGCGAGGTCCGCGATGCATTCGTCGCTGCGCACGCCGAGG 
CGCGCTCGGCTTTCGTACCTAG 

>ORF27314c (SEQ ID NO:308) 

AGTGGGAATCGCCGCTGCCGGAAGAATTCCTCTGGCTGCTCAGCCTGTTGCGCCAGGATCGCGAGGCGTTCGTCGGTTGA 
ACGCCTGGCTGACCCCCGACTGGCCGGCGCCGGCCCGTGTGCGGGCCTGCGTGACCACGCGCAGTGGCGGGGTCAGCCAG 
GCACCCTTCGACAGTCTGAACCTTGGCGCCCACGTCTACGACGATCCGCGCGCCGTGGAATAAAACCGCCGACGCCTGAC 
CGAGCGTCTGGAATGCCGGCCGAGCTGGTTGGAATAGGTGCACGGAGTGACGTGGTGGAGGCCGACCCGAACCGGTGCTC 
AGGGCCGACGCCAGCTGGAGCGCGATGCCGGGCGTCGCCTGTACGATCATGA 
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>ORF27730c (SEQ ID N0:310) 

CAA6CCC(KC66CCTG6TGGTCCATCCG6CT6CC6GCCATCA6GAC66CACCCTGCT6AATGCCTTGCTCTACCATGTCC 
CGGACATCGCCAATGTGCCGCGCGCCGGGATCGTCCACCGCCTGGACAAGGACACGACCGGCCTGATGGTAGTGGCCAAG 
ACGCTGGAGGCCCACACCAAGCTGGTGGCGCAACTGCAGGCACGGTCGGTCAGCCGCATCTACGAGGCGATCGTGATCGG 
CGTGATCACCTCCGGCGGCACCATCGATGCGCCGATCGGACGGCATGGCGTGCAGCGGCAGAAGATGGCGGTGGTCGACG 
CCGGCAAGGTGGCGGTCAGCCATTACCGCGTGCTGGAACGCTTCCGTGCGCACACCCATACCCGGGTCAAGCTGGAGACC 
GGGCGTACCCACCAGATCCGCGTGCACATGAGCCATATTGGCTATCCCCTGGTCGGCGATCCGGTCTACGGTGGGCGCTT 
CAGGATTCCCCCGGTGGCCAGCCAGACCCTGGTCCAGACTCTTCGCGAATTCCCCCGGCAGGCGCTGCACGCGCGCTTCC 
TCGAACTGGATCACCCGGCCACCGGCGTGCGCATGAAGTGGGAATCGCCGCTGCCGGAAGAATTCCTCTGGCTGCTCAGC 
CTGTTGCGCCAGGATCGCGAGGCGTTCGTCGGTTGAACGCCTGGCTGACCCCCGACTGGCCGGCGCCGGCCCGTGTGCGG 
GCCTGCGTGACCACGCGCAGTGGCGGGGTCAGCCAGGCACCCTTCGACAGTCTGAACCTTGGCGCCCACGTCTACGACGA 
TCCGCGCGCCGTGGAATAA 

>ORF26983 (SEQ ID NO:312) 

CCCCGCCACTGCGCGTGGTCACGCAGGCCCGCACACGGGCCGGCGCCGGCCAGTCGGGGGTCAGCCAGGCGTTCAACCGA 
CGAACGCCTCGCGATCCTGGCGCAACAGGCTGAGCAGCCAGAGGAATTCTTCCGGCAGCGGCGATTCCCACTTCATGCGC 
ACGCCGGTGGCCGGGTGATCCAGTTCGAGGAAGCGCGCGTGCAGCGCCTGCCGGGGGAATTCGCGAAGAGTCTGGACCAG 
GGTCTGGCTGGCCACCGGGGGAATCCTGAAGCGCCCACCGTAGACCGGATCGCCGACCAGGGGATAGCCAATATGGCTCA 
TGTGCACGCGGATCTGGTGGGTACGCCCGGTCTCCAGCTTGACCCGGGTATGGGTGTGCGCACGGAAGCGTTCCAGCACG 
CGGTAATGGCTGACCGCCACCTTGCCGGCGTCGACCACCGCCATCTTCTGCCGCTGCACGCCATGCCGTCCGATCGGCGC 
ATCGATGGTGCCGCCGGAGGTGATCACGCCGATCACGATCGCCTCGTAGATGCGGCTGACCGACCGTGCCTGCAGTTGCG 
CCACCAGCTTGGTGTGGGCCTCCAGCGTCTTGGCCACTACCATCAGGCCGGTCGTGTCCTTGTCCAGGCGGTGGACGATC 
CCGGCGCGCGGCACATTGGCGATGTCCGGGACATGGTAGAGCAAGGCATTCAGCAGGGTGCCGTCCTGATGGCCGGCAGC 
CGGATGGACCACCAGGCCGGCGGGCTTGTCAATCACCAGGATGTGCTCGTCCTCGTAGACGATTTCCAGCTCGATGTCCT 
GTGCGAGCCACTCGCCCTGGGCTTCCTGCTCGGCCTCCAGGACCAGTTGCGCGCCGCTGTGGACGATGTCGCGCGGGCGC 
AGCACGGCGCCGTCGACGGTCAGGCGACCGTCCTTGATCCAGCCGGCCAGACGGGAGCGGGAGTGTTCGGGAAAAAGCTG 
GGCGGCGATCTGGTCGAGACGCTGGCCACCCAGCTCGAACGGCACCTCGGCCGCGCGTTGAATCATATCGGACATGAGTA 
G 

>ORF28068c (SEQ ID NO:314) 

CCACAGCGCGTAGCCGATTCCAAAAGCCGCGCTGAGCATCGTCTCCTACTCATGTCCGATATGATTCAACGCGCGGCCGA 
GGTGCCGTTCGAGCTGGGTGGCCAGCGTCTCGACCAGATCGCCGCCCAGCTITTTCCCGAACACTCCCGCTCCCGTCTGG 
CCGGCTGGATCAAGGACGGTCGCCTGACCGTCGACGGCGCCGTGCTGCGCCCGCGCGACATCGTCCACAGCGGCGCGCAA 
CTGGTCCTGGAGGCCGAGCAGGAAGCCCAGGGCGAGTGGCTCGCACAGGACATCGAGCTGGAAATCGTCTACGAGGACGA 
GCACATCCTGGTGATTGACAAGCCCGCCGGCCTGGTGGTCCATCCGGCTGCCGGCCATCAGGACGGCACCCTGCTGAATG 
CCTTGCTCTACCATGTCCCGGACATCGCCAATGTGCCGCGCGCCGGGATCGTCCACCGCCTGGACAAGGACACGACCGGC 
CTGATGGTAGTGGCCAAGACGCTGGAGGCCCACACCAAGCTGGTGGCGCAACTGCAGGCACGGTCGGTCAGCCGCATCTA 
CGAGGCGATCGTGATCGGCGTGATCACCTCCGGCGGCACCATCGATGCGCCGATCGGACGGCATGGCGTGCAGCGGCAGA 
AGATGGCGGTGGTCGACGCCGGCAAGGTGGCGGTCAGCCATTACCGCGTGCTGGAACGCTTCCGTGCGCACACCCATACC 
CGGGTCAAGCTGGAGACCGGGCGTACCCACCAGATCCGCGTGCACATGAGCCATATTGGCTATCCCCTGGTCGGCGATCC 
GGTCTACGGTGGGCGCTTCAGGATTCCCCCGGTGGCCAGCCAGACCCTGGTCCAGACTCTTCGCGAATTCCCCCGGCAGG 
CGCTGCACGCGCGCTTCCTCGAACTGGATCACCCGGCCACCGGCGTGCGCATGAAGTGGGAATCGCCGCTGCCGGAAGAA 
TTCCTCTGGCTGCTCAGCCTGTTGCGCCAGGATCGCGAGGCGTTCGTCGGTTGA 

>ORF27522 (SEQ ID NO:316) 

CCGACCGTGCCTGCAGTTGCGCCACCAGCTTGGTGTGGGCCTCCAGCGTCTTGGCCACTACCATCAGGCCGGTCGTGTCC 
TTGTCCAGGCGGTGGACGATCCCGGCGCGCGGCACATTGGCGATGTCCGGGACATGGTAGAGCAAGGCATTCAGCAGGGT 
GCCGTCCTGATGGCCGGCAGCCGGATGGACCACCAGGCCGGCGGGCTTGTCAATCACCAGGATGTGCTCGTCCTCGTAGA 
CGATTTCCAGCTCGATGTCCTGTGCGAGCCACTCGCCCTGGGCTTCCTGCTCGGCCTCCAGGACCAGTTGCGCGCCGCTG 
TGGACGATGTCGCGCGGGCGCAGCACGGCGCCGTCGACGGTCAGGCGACCGTCCTTGA 
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>ORF28033c (SEQ ID NO:318) 

GCATC6TCTCCTACTCATGTCCGATAT6ATTCAACGCGCG6CCGAGGTGCCGTTCGAGCT66GTGGCCAGC6TCTCGACC 
AGATCGCCGCCCAGCTTTTTCCCGAACACTCCCGCTCCCGTCTGGCCGGCTGGATCAAGGACGGTCGCCTGACCGTCGAC 
GGCGCCGTGCTGCGCCCGCGCGACATCGTCCACAGCGGCGCGCAACTGGTCCTGGAGGCCGAGCAGGAAGCCCAGGGCGA 
GTGGCTCGCACAGGACATCGAGCTGGAAATCGTCTACGAGGACGAGCACATCCTGGTGATTGA 

>ORF2970k (SEQ ID NO:320) 

TCTTCCAGTTCGCTGGAGATCAGCAGGACCAGTACCAGGCCGATGGTCAGGCGGTACAGGTGGTACAGACGGAGGATGCG 
TTGCCCCTGCTCCTCGCTCAGCCGTAGCCGTTCAGCGCGCACGGTCGCCCTGGTCCTGGCGCAGGTGCGCCTGGCTGCAA 
TACCAGCGTTGTTCGTGGGCGAGGGCGTTGGCCTGCGGCACGTGGACGCCGCAATGGGCGCAGCGGACCATCGGCGATGC 
GCTCGGCTCGTCCTGCGGACGTTGCTGCTGGCGCGGAGTGGGACGGGTAAAGCGACGCCAGAGCCAGAACGCGATGGCGA 
TCAGGGCGATCQGMCAGGAGGCGGAAAAGGCCCATGGTGATCTCGGAGGCTGGAGAAAGCTGCAGTTTAGCCAAGCCG 
CCGGCTCGATCCCAGACGGGAAGGTCCAGGCTGTGCGGCGTTTGGCGCTGGGAGAGGCATGGCGGCGGGCAAAAAGAAGG 
GAGGCCTGCGCCTCCCTTCGGTGTTTCGTGCGATCAGTCGAAGAGACCGAAGGTCATGTAGCTCCACCAGGAGCGACCGG 
AGTCCTCGTCGTCATCGCTCTCCGGCTTCTCGTCGTCGGCGCTGTGATCCTGGTTTTCCGGCTTCAGTTCGGCGGGGATC 
TCCCGCTCGGCATCCTCGTACTGCTTGATCACGTCCTTGGCGGCCTGGGTTTCCATGTGCGGCGGCGGCTCGCCGCCTTC 
GATCAGGCCCAGGGTGGCCTTGGCCAGCCAGGAGCGGGTGTCGGCCTCGCTTTCGCGGGCGACGAACTCGCCATCCTTGA 
GGCTGGCGTTATCCGGATAGTTCAGCTTGAGGGTTTCCAGGCTGGTGCTGGCCAGGTCGTCGAGACCCAGGCGACGGTAG 
GCTTCGACCATGATCGCCAGGCCATCGCCGACGGCCGGGGTTTCCTGGAAGTTCTCCACCACGTAGCGACCGCGGTTGGC 
GGCGGCGACATAGGCCTGGCGCTTCAGGTAGTAGTGGCCGACGTGCACTTCGTAGGCCGCCAGCAGGTTGCGCAGGTACA 
CCATGCGCGCCTTGGCGTCCGGGGCGTAGCGGCTGTTGGGGAAGCGGCTGGTGAGCTGGGCGAACTCGTTGAAGGAGTCG 
CGGGCGGCGCCCGGGTCGCGCTTGGTCATGTCCAGCGGCAGGAAGCGCGCCAGCAGGCCGCGGTCCTGGTCGAAGGAGGA 
CAGGCCTTTGAGGTAGTAGGCGTAGTCGACGTTGGGGTGCTGCGGATGCAGGCGGATGAAGCGTTCGGCGGCGGCGCGGG 
CGGCTTCGGGCTCCATGTTCTTGTAGTTGGCGTAGATCAGCTCGAGCTGGGCCTGCTCGGCGTAGCGGCCGAAGGGATAG 
CGCGATTCGAGGGCTTTCAGCTTGGTGACGGCGCTGTTGTAGCTCTTGTTGTTGAGGTCGTCCTGCGCCTGCTGGTACAG 
CTGGCTCTCGCTCAGGTTCTCGTCGACAGTCTCCTTGTTCGAGGAGCAGGCTGCGGTGAGGGCGAGGATGGCGATCAGCA 
GCAGGTGTTTCACTTGCATGGCGGCTTGCGTCCCTGGGACGGTCGGCTTGGCCTCAACCGTCTGTTATGA 

>ORF28118(SEQID NO:322) 

CAGACGGTTGAGGCCAAGCCGACCGTCCCAGGGACGCAAGCCGCCATGCAAGTGAAACACCTGCTGCTGATCGCCATCCT 
CGCCCTCACCGCAGCCTGCTCCTCGAACAAGGAGACTGTCGACGAGAACCTGAGCGAGAGCCAGCTGTACCAGCAGGCGC 
AGGACGACCTCAACAACAAGAGCTACAACAGCGCCGTCACCAAGCTGAAAGCCCTCGAATCGCGCTATCCCTTCGGCCGC 
TACGCCGAGCAGGCCCAGCTCGAGCTGATCTACGCCAACTACAAGAACATGGAGCCCGAAGCCGCCCGCGCCGCCGCCGA 
ACGCTTCATCCGCCTGCATCCGCAGCACCCCAACGTCGACTACGCCTACTACCTCAAAGGCCTGTCCTCCTTCGACCAGG 
ACCGCKCCTGCTGGCGCGCTTCCTGCCGCTGGACAT^CCAAGCGCGACCCGGGCKCKCCGCGACTCCTTCAACGAG 
TTCGCCCAGCTCACCAGCCGCTTCCCCAACAGCCGCTACGCCCCGGACGCCAAGGCGCGCATGGTGTACCTGCGCAACCT 
GCTGGCGGCCTACGAAGTGCACGTCGGCCACTACTACCTGAAGCGCCAGGCCTATGTCGCCGCCGCCAACCGCGGTCGCT 
ACGTGGTGGAGAACTTCCAGGAAACCCCGGCCGTCGGCGATGGCCTGGCGATCATGGTCGAAGCCTACCGTCGCCTGGGT 
CTCGACGACCTGGCCAGCACCAGCCTGGAAACCCTCAAGCTGAACTATCCGGATAACGCCAGCCTCAAGGATGGCGAGTT 
CGTCGCCCGCGAAAGCGAGGCCGACACCCGCTCCTGGCTGGCCAAGGCCACCCTGGGCCTGATCGAAGGCGGCGAGCCGC 
CGCCGCACATGGAAACCCAGGCCGCCAAGGACGTGATCAAGCAGTACGAGGATGCCGAGCGGGAGATCCCCGCCGAACTG 
AAGCCGGAAAACCAGGATCACAGCGCCGACGACGAGAAGCCGGAGAGCGATGACGACGAGGACTCCGGTCGCTCCTGGTG 
GAGCTACATGACCTTCGGTCTCTTCGACTGA 

>ORF28129(SEQIDNO:324) 

GGCCAAGCCGACCGTCCCAGGGACGCAAGCCGCCATGCAAGTGAAACACCTGCTGCTGATCGCCATCCTCGCCCTCACCG 
CAGCCTGCTCCTCGAACAAGGAGACTGTCGACGAGAACCTGAGCGAGAGCCAGCTGTACCAGCAGGCGCAGGACGACCTC 
AACAACAAGAGCTACAACAGCGCCGTCACCAAGCTGAAAGCCCTCGAATCGCGCTATCCCTTCGGCCGCTACGCCGAGCA 
GGCCCAGCTCGAGCTGATCTACGCCAACTACAAGAACATGGAGCCCGAAGCCGCCCGCGCCGCCGCCGAACGCTTCATCC 
GCCTGCATCCGCAGCACCCCAACGTCGACTACGCCTACTACCTCAAAGGCCTGTCCTCCTTCGACCAGGACCGCGGCCTG 
CTGGCGCGCTTCCTGCCGCTGGACATGACCAAGCGCGACCCGGGCGCCGCCCGCGACTCCTTCAACGAGTTCGCCCAGCT 
CACCAGCCGCTTCCCCAACAGCCGCTACGCCCCGGACGCCAAGGCGCGCATGGTGTACCTGCGCAACCTGCTGGCGGCCT 
ACGAAGTGCACGTCGGCCACTACTACCTGAAGCGCCAGGCCTATGTCGCCGCCGCCAACCGCGGTCGCTACGTGGTGGAG 
AACTTCCAGGAAACCCCGGCCGTCGGCGATGGCCTGGCGATCATGGTCGAAGCCTACCGTCGCCTGGGTCTCGACGACCT 
GGCCAGCACCAGCCTGGAAACCCTCAAGCTGAACTATCCGGATAA 
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>ORF29709c (SEQ ID NO:326) 

GGACCT6ATCTTCCAGTTC6CTG6AGATCAGCAGGACCA6TACCAG6CCGAT6GTCA6GC6GTACA6GTGGTACA6AC66 
AGGATGCGTTGCCCCTGCTCCTCGCTCAGCCGTAGCCGTTCAGCGCGCACGGTCGCCCTGGTCCTGGCGCAGGTGCGCCT 
GGCTGCAATACCAGCGTTGTTCGTGGGCGAGGGCGTTGGCCTGCGGCACGTGGACGCCGCAATGGGCGCAGCGGACCATC 
GGCGATGCGCTCGGCTCGTCCTGCGGACGTTGCTGCTGGCGCGGAGTGGGACGGGTAAAGCGACGCCAGAGCCAGAACGC 
GATGGCGATCAGGGCGATCCAGAACAGGAGGCGGAAAAGGCCCATGGTGATCTCGGAGGCTGGAGAAAGCTGCAGTTTAG 
CCAAGCCGCCGGCTCGATCCCAGACGGGAAGGTCCAGGCTGTGCGGCGTTTGGCGCTGGGAGAGGCATGGCGGCGGGCAA 
AAAGAAGGGAGGCCTGCGCCTCCCTTCGGTGTTTCGTGCGATCAGTCGAAGAGACCGAAGGTCATGTAG 

>ORF29189(SEQIDNO:328) 

TCGCACGAAACACCGAAGGGAGGCGCAGGCCTCCCTTCTTTTTGCCCGCCGCCATGCCTCTCCCAGCGCCAAACGCCGCA 
CAGCCTGGACCTTCCCGTCTGGGATCGAGCCGGCGGCTTGGCTAAACTGCAGCTTTCTCCAGCCTCCGAGATCACCATGG 
GCCTTTTCCGCCTCCTGTTCTGGATCGCCCTGATCGCCATCGCGTTCTGGCTCTGGCGTCGCTTTACCCGTCCCACTCCG 
CGCCAGCAGCAACGTCCGCAGGACGAGCCGAGCGCATCGCCGATGGTCCGCTGCGCCCATTGCGGCGTCCACGTGCCGCA 
GGCCAACGCCCTCGCCCACGAACAACGCTGGTATTGCAGCCAGGCGCACCTGCGCCAGGACCAGGGCGACCGTGCGCGCT 
GA 

>ORF29382(SEQIDNO:330) 

TCGCCATCGCGTTCTGGCTCTGGCGTCGCTTTACCCGTCCCACTCCGCGCCAGCAGCAACGTCCGCAGGACGAGCCGAGC 

GCATCGCCGATGGTCCGCTGCGCCCATTGCGGCGTCCACGTGCCGCAGGCCAACGCCCTCGCCCACGAACAACGCTGGTA 

TTGCAGCCAGGCGCACCTGCGCCAGGACCAGGGCGACCGTGCGCGCTGAACGGCTACGGCTGAGCGAGGAGCAGGGGCAA 

CGCATCCTCCGTCTGTACCACCTGTACCGCCTGACCATCGGCCTGGTACTGGTCCTGCTGATCTCCAGCGAACTGGAAGA 

TCAGGTCCTCAAGCTCGTCCACCCTGAACTGTTCCATGTCGGCAGTTGGTGCTACCTGGTCTTCAACATCCTGGTCGCGC 

TGTTCCTGCCGCCGTCGCGGCAATTGCTGCCGATCTTCATCCTCGCGCTCACCGACGTGCTGATGCTTTGCGGCCTGTTC 

TACGCAGGTGGCGGCGTACCCAGCGGCATCGGCAGCCTGCTGGTGGTGGCGGTGGCCATTGCCAACATCCTGCTGCGCGG 

GCGCATCGGCCTGGTCATCGCGGCGGCGGCCAGCCTCGGCCTGCTCTACCTGACCTTCTTCCTCAGCCTGAGCAGTCCGG 

ACGCCACCAACCACTACGTCCAGGCCGGCGGCCTCGGCACCCTGTGCTTCGCCGCCGCGCTGGTGATCCAGGCTCTGGTG 

CGGCGCCAGGAGCAGACCGAAACGCTGGCCGAAGAACGCGCCGAGACGGTCGCCAACCTGGAGGAACTCAACGCATTGAT 

CCTGCAGCGCATGCGCACCGGCATCCTCGTGGTCGATAGCCGTCAGGCCATCCTCCTCGCCAACCAGGCCGCCCTCGGCC 

TGCT(^G(XAGGACGACGTGCAGGGCGCCA(K:CTCGGCCGCCACAGCCC<3ATGCTGATGCACTGC^TGAAG 

CTGAATCCCAGCCTCCGTCCGCCGACGCTCAAGGTGGTGCCGGATGGCCCGACGGTGCAACCCAGCTTTATCAGCCTCAA 

CCGCGAAGACGACCAGCACGTGCTGATCTTCCTCGAAGACATTTCGCAGATCGCCCAGCAGGCGCAGCAGATGAAGCTGG 

CCGGTCTTGGCCGCCTGACCGCCGGCATCGCCCATGAGATCCGCAACCCGCTGGGCGCGATCAGCCACGCCGCCCAACTG 

CTGCAGGAGTCAGAGGAACTGGATGCCCCGGACCGACGCCTGACGCAGATCATCCAGGACCAGTCGAAGCGGATGAACCT 

GGTCATCGAGAACGTCCTGCAGCTCTCCCGTCGCCGCCAGGCCGAACCGCAGCAGCTCGACCTGAAGGAGTGGCTTCAGC 

GGTTCGTCGACGAATACCCCGGCAGGCTGCGCAACGACAGCCAACTGCACCTGCAGCTCGGTGCCGGCGACATCCAGACC 

CGCATGGACCCACACCAGTTGAACCAGGTGCTGAGCAACCTGGTGCAGAACGGTCTTCGCTACAGCGCCCAGGCGCACGG 

GCGCGGCCAGGTCTGGCTGAGCCTCGCGCGCGACCCGGAGAGCGACCTGCCGGTGCTGGAAGTCATCGACGACGGTCCCG 

GCGTACCGGCGGACAAACTGAACAACCTGTTCGAACCCTTCTTTACTACAGAAAGCAAAGGCACCGGCCTGGGCCTCTAT 

CTCTCCCGCGAACTCTGCGAGAGCAACCAGGCACGGATCGACTACCGCAATCGCGAGGAAGGCGGCGGCTGCTTCCGCAT 

CACCTTCGCCCACCCGCGCAAACTCAGCTGA 
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>ORF30590c (SEQ ID NO:332) 

CTCCT6CAGCAGTTG6GCG6CGTGGCTGATCGC6CCCAGCG6GTT6CG6ATCTCATGGGCGAT6CC66CGGTCAG6C6GC 
CAAGACCGGCCAGCTTCATCTGCTGCGCCTGCTGGGCGATCTGCGAAATGTCTTCGAGGAAGATCAGCACGTGCTGGTCG 
TCTTCGCGGTTGAGGCTGATAAAGCTGGGTTGCACCGTCGGGCCATCCGGCACCACCTTGAGCGTCGGCGGACGGAGGCT 
GGGATTCA(WC(KCATTGCTTCATGCAGTGQT(^GCATCGGGCTGTGGCG(KC(^G(K:TGGCGCCCTGCACGTCGTCCT 
GCCTGAGCAGGCCGAGGGCGGCCTGGTTGGCGAGGAGGATGGCCTGACGGCTATCGACCACGAGGATGCCGGTGCGCATG 
CGCTGCAGGATCAATGCGTTGAGTTCCTCCAGGTTGGCGACCGTCTCGGCGCGTTCTTCGGCCAGCGTTTCGGTCTGCTC 
CTGGCGCCGCACCAGAGCCTGGATCACCAGCGCGGCGGCGAAGCACAGGGTGCCGAGGCCGCCGGCCTGGACGTAGTGGT 
TGGTGGCGTCCGGACTGCTCAGGCTGAGGAAGAAGGTCAGGTAGAGCAGGCCGAGGCTGGCCGCCGCCGCGATGACCAGG 
CCGATGCGCCCGCGCAGCAGGATGTTGGCAATGGCCACCGCCACCACCAGCAGGCTGCCGATGCCGCTGGGTACGCCGCC 
ACCTGCGTAGAACAGGCCGCAAAGCATCAGCACGTCGGTGAGCGCGAGGATGAAGATCGGCAGCAATTGCCGCGACGGCG 
GCAGGAACAGCGCGACCAGGATGTTGAAGACCAGGTAGCACCAACTGCCGACATGGAACAGTTCAGGGTGGACGAGCTTG 
AGGACCTGATCTTCCAGTTCGCTGGAGATCAGCAGGACCAGTACCAGGCCGATGGTCAGGCGGTACAGGTGGTACAGACG 
GAGGATGCGTTGCCCCTGCTCCTCGCTCAGCCGTAG 

>ORF29729 (SEQ ID NO:334) 

ACTGTTCCATGTCGGCAGTTGGTGCTACCTGGTCTTCAACATCCTGGTCGCGCTGTTCCTGCCGCCGTCGCGGCAATTGC 
TGCCGATCTTCATCCTCGCGCTCACCGACGTGCTGATGCTTTGCGGCCTGTTCTACGCAGGTGGCGGCGTACCCAGCGGC 
ATCGGCAGCCTGCTGGTGGTGGCGGTGGCCATTGCCAACATCCTGCTGCGCGGGCGCATCGGCCTGGTCATCGCGGCGGC 
GGCCAGCCTCGGCCTGCTCTACCTGACCTTCTTCCTCAGCCTGAGCAGTCCGGACGCCACCAACCACTACGTCCAGGCCG 
GCGGCCTCGGCACCCTGTGCTTCGCCGCCGCGCTGGTGATCCAGGCTCTGGTGCGGCGCCAGGAGCAGACCGAAACGCTG 
GCCGAAGAACGCGCCGAGACGGTCGCCAACCTGGAGGAACTCAACGCATTGATCCTGCAGCGCATGCGCACCGGCATCCT 
CGTGGTCGATAG 

>ORF30221 (SEQ ID NO:336) 

CCGTCAGGCCATCCTCCTCGCCAACCAGGCCGCCCTCGGCCTGCTCAGGCAGGACGACGTGCAGGGCGCCAGCCTCGGCC 
GCCACAGCCCGATGCTGATGCACTGCATGAAGCAATGGCGCCTGAATCCCAGCCTCCGTCCGCCGACGCTCAAGGTGGTG 
CCGGATGGCCCGACGGTGCAACCCAGCTTTATCAGCCTCAACCGCGAAGACGACCAGCACGTGCTGATCTTCCTCGAAGA 
CATTTCGCAGATCGCCCAGCAGGCGCAGCAGATGAAGCTGGCCGGTCTTGGCCGCCTGACCGCCGGCATCGCCCATGA 

>ORF30736c (SEQ ID NO:338) 

AGCCACTCCTTCAGGTCGAGCTGCTGCGGTTCGGCCTGGCGGCGACGGGAGAGCTGCAGGACGTTCTCGATGACCAGGTT 
CATCCGCTTCGACTGGTCCTGGATGATCTGCGTCAGGCGTCGGTCCGGGGCATCCAGTTCCTCTGACTCCTGCAGCAGTT 
GGGCGGCGTGGCTGATCGCGCCCAGCGGGTTGCGGATCTCATGGGCGATGCCGGCGGTCAGGCGGCCAAGACCGGCCAGC 
TTQTCTGCTGCGCCTGCTGGGCGATCTGCGAAATGTCTTCGAGGAAGATCAGCACGTGCTGGTCGTCTTCGCGGTTGAG 
GCTGATAAAGCTGGGTTGCACCGTCGGGCCATCCGGCACCACCTTGAGCGTCGGCGGACGGAGGCTGGGATTCAGGCGCC 
ATTGCTTCATGCAGTGCATCAGCATCGGGCTGTGGCGGCCGAGGCTGGCGCCCTGCACGTCGTCCTGCCTGAGCAGGCCG 
AGGGCGGCCTGGTTGGCGAGGAGGATGGCCTGA 

>ORF30539 (SEQ ID NO:340) 

GATCCGCAACCCGCTGGGCGCGATCAGCCACGCCGCCCAACTGCTGCAGGAGTCAGAGGAACTGGATGCCCCGGACCGAC 
GCCTGACGCAGATCATCCAGGACCAGTCGAAGCGGATGAACCTGGTCATCGAGAACGTCCTGCAGCTCTCCCGTCGCCGC 
CAGGCCGAACCGCAGCAGCTCGACCTGAAGGAGTGGCTTCAGCGGTTCGTCGACGAATACCCCGGCAGGCTGCGCAACGA 
CAGCCAACTGCACCTGCAGCTCGGTGCCGGCGACATCCAGACCCGCATGGACCCACACCAGTTGAACCAGGTGCTGAGCA 
ACCTGGTGCAGAACGGTCTTCGCTACAGCGCCCAGGCGCACGGGCGCGGCCAGGTCTGGCTGAGCCTCGCGCGCGACCCG 
GAGAGCGACCTGCCGGTGCTGGAAGTCATCGACGACGGTCCCGGCGTACCGGCGGACAAACTGAACAACCTGTTCGAACC 
CTTCTTTACTACAGAAAGCAAAGGCACCGGCCTGGGCCTCTATCTCTCCCGCGAACTCTGCGAGAGCAACCAGGCACGGA 
TCGACTACCGCAATCGCGAGGAAGGCGGCGGCTGCTTCCGCATCACCTTCGCCCACCCGCGCAAACTCAGCTGACGGAAG 
CCGCACGCATGAGCCGACAAAAAGCCCTGATCGTCGACGATGAACCGGATATCCGCGAACTGCTGGAAATCACTCTCGGC 
CGCATGAAGCTGGACACCCGCAGCGCCCGCAACGTCAAGGAAGCCGCGAGTTGCTGGCCCGCGAGCCGTTCGACCTGTGC 
CTCACCGACATGCGCCTGCCGGACGGCAGCGGCCTCGA1CTGGTCCAGTACATCCAGCAGCGCCATCCACAGACCCCGGT 
GGCCATGA 
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>ORF31247c (SEQ ID NO:342) 

TTTCCAGCA6TTCGCG6ATATCC6GTTCATCGTC6ACGATCAGGGCTTTTTGTCGGCTCAT6CGTGCGGCTTCCGTCAGC 
TGAGTTTGCGCGGGTGGGCGMGGTGATGCGGAAGCAGCCGCCGCCTTCCTCGCGATTGCGGTAGTCGATCCGTGCCTGG 
TTGCTCTCGCAGAGTTCGCGGGAGAGATAGAGGCCCAGGCCGGTGCCTTTGCTTTCTGTAGTAAAGAAGGGTTCGAACAG 
GTTGTTCAGTTTGTCCGCCGGTACGCCGGGACCGTCGTCGATGACTTCCAGCACCGGCAGGTCGCTCTCCGGGTCGCGCG 
CGAGGCTCAGCCAGACCTGGCCGCGCCCGTGCGCCTGGGCGCTGTAGCGAAGACCGTTCTGCACCAGGTTGCTCAGCACC 
TGGTTCAACTGGTGTGGGTCCATGCGGGTCTGGATGTCGCCGGCACCGAGCTGCAGGTGCAGTTGGCTGTCGTTGCGCAG 
CCTGCCGGGGTATTCGTCGACGAACCGCTGAAGCCACTCCTTCAGGTCGAGCTGCTGCGGTTCGGCCTGGCGGCGACGGG 
AGAGCTGCAGGACGTTCTCGATGACCAGGTTCATCCGCTTCGACTGGTCCTGGATGATCTGCGTCAGGCGTCGGTCCGGG 
GCATCCAGTTCCTCTGA 

>ORF30963c (SEQ ID NO:344) 

CTTCCAGCACCGGCAGGTCGCTCTCCGGGTCGCGCGCGAGGCTCAGCCAGACCTGGCCGCGCCCGTGCGCCTGGGCGCTG 
TAGCGAAGACCGTTCTGCACCAGGTTGCTCAGCACCTGGTTCAACTGGTGTGGGTCCATGCGGGTCTGGATGTCGCCGGC 
ACCGAGCTGCAGGTGCAGTTGGCTGTCGTTGCGCAGCCTGCCGGGGTATTCGTCGACGAACCGCTGAAGCCACTCCTTCA 
GGTCGAGCTGCTGCGGTTCGGCCTGGCGGCGACGGGAGAGCTGCAGGACGTTCTCGATGA 

>ORF31539c (SEQ ID NO:346) 

ggcggttgccaccagctcccgcaagcgaccgaggtcgaccggtttggtgaggaagtcgaaggcaccggccttgagcgcct 
ggatcgcggtgtccaggctgccgtacgcggtgatcatggccaccggggtctgtggatggcgctgctggatgtactggacc 
agatcgaggccgctgccgtccggcaggcgcatgtcggtgaggcacaggtcgaacggctcgcgggccagcaactcgcggct 
tccttgacgttgcgggcgctgcgggtgtccagcttcatgcggccgagagtgatttccagcagttcgcggatatccggttc 
atcgtcgacgatcagggctttttgtcggctcatgcgtgcggcttccgtcagctga 

>orf31222(seqidno:348) 
accg&tatccgcgaactgctggaaatcactctw^ 

cc(x:gagttgct(mcccgcga(^cgttcgacctgtgcctcaccgacat(k:gcctgccggacggcagcggcctcgatctgg 
tccagtacatccagcagcgccatccacagaccccggtggccatgatcaccgcgtacggcagcctggacaccgcgatccag 
gcgctcaaggccggtgccttcgacttcctcaccaaaccggtcgacctcggtcgcttgcgggagctggtggcaaccgccct 
acgcttgcgcaacccggaagccgaggaagcgccggtggacaaccgcctgctcggcgagtcgccgccgatgcgcgccctgc 
gcaaccagatcggcaagctggcgcgcagccaggcgccggtctacatcagtggcgagtccggcagcggcaaggaactggtg 
gcgcgcctgatccacgagcaggggccacgtatcgagcggccgttcgtgccggtgaactgcggcgcgattccctccgagct 
gatggaaagcgagttcttcggccacaagaaaggcagcttcactggcgctatcgaagacaagcagggcctgttccaggccg 
ccagcggtgg<^ccctgttcctcgacgaagtc(k:cgacct(k:cgatggccatgcaggtcaaactgctccgggcgatccag 
gaaaaggccgtgcgcgcggtcggcggccagcaggaggtcgccgtcgcacgtgcgcatcctctgcgccacccacaaggacc 
tcgccgccgaagtcggcgccgggcgcttccgccaggacctctactaccgcctcaacgtcatcgagctgcgcgtacaccgc 
tgcgcgaacgccgcgaggacatcccgctgctcgccgaacgcatcctcaagcgcctggccggcgacaccggcctgccggcc 
gccaggctgaccggcgacgcacaggagaagctgaagaactaccgcttcccgggcaacgtccgcgagctggaaaacatgct 
ggagcgcgcctataccctgtgcgaagacgaccagatccagcctcacgacctgcgcctggccgatgcgccgggtgccagcc 
aggaaggcgccgcgagcctgagcgaaatcgacaacctcgaggactacctggaagacatcgagcgcaagctgatcatgcag 
gcactcgaggagacccgctggaaccgcaccgccgcggcccagcgcctgggcctgacgttccgctcgatgcgctaccgcct 
gaaaaagctgggcatcgactga 
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>ORF31266 (SEQ ID NO:350) 

AGCT66ACACCCGCAGCGCCC6CAAC6TCAAGGAAGCCGCGAGTTGCT6GCCCGCGAGCCGTTC6ACCTGTGCCTCACCG 
ACATGCGCCTGCCGGACGGCAGCGGCCTCGATCTGGTCCAGTACATCCAGCAGCGCCATCCACAGACCCCGGTGGCCATG 
ATCACCKGTACGGCAGCCTGGAQCCGCGATCCAGGCGCT^GGCCGGTGCCTTC^CTTCCTCACCAAACCGGTCGA 
CCTCGGTCGCTTGCGGGAGCTGGTGGCMCCGCCCTACGCTTGCGCAACCCGGAAGCCGAGGAAGCGCCGGTGGACAACC 
GCCTGCTCGGCGAGTCGCCGCCGATGCGCGCCCTGCGCAACCAGATCGGCAAGCTGGCGCGCAGCCAGGCGCCGGTCTAC 
ATCAGTGGCGAGTCCGGCAGCGGCAAGGAACTGGTGGCGCGCCTGATCCACGAGCAGGGGCCACGTATCGAGCGGCCGTT 
CGTGCCGGTGMCTGCGGCGCGATTCCCTCCGAGCTGATGGAAAGCGAGTTCTTCGGCCAC^GAAAGGCAGCTTCACTG 
GCGCTATCGAAGACAAGCAGGGCCTGTTCCAGGCCGCCAGCGGTGGCACCCTGTTCCTCGACGAAGTCGCCGACCTGCCG 
ATGGCCATGCAGGTCAAACTGCTCCGGGCGATCCAGGAAAAGGCCGTGCGCGCGGTCGGCGGCCAGCAGGAGGTCGCCGT 
CGCACGTGCGCATCCTCTGCGCCACCCACAAGGACCTCGCCGCCGAAGTCGGCGCCGGGCGCTTCCGCCAGGACCTCTAC 
TACCGCCTCAACGTCATCGAGCTGCGCGTACACCGCTGCGCGAACGCCGCGAGGACATCCCGCTGCTCGCCGAACGCATC 
CTCAAGCGCCTGGCCGGCGACACCGGCCTGCCGGCCGCCAGGCTGACCGGCGACGCACAGGAGAAGCTGAAGAACTACCG 
CTTCCCGGGCAACGTCCGCGAGCTGGAAAACATGCTGGAGCGCGCCTATACCCTGTGCGAAGACGACCAGATCCAGCCTC 
ACGACCTGCGCCTGGCCGATGCGCCGGGTGCCAGCCAGGAAGGCGCCGCGAGCCTGAGCGAAATCGACAACCTCGAGGAC 
TACCTGGAAGACATCGAGCGCAAGCTGATCATGCAGGCACTCGAGGAGACCCGCTGGAACCGCACCGCCGCGGCCCAGCG 
CCTGGGCCTGACGTTCCGCTCGATGCGCTACCGCCTGAAAAAGCTGGGCATCGACTGAAAGTGAAAAGGCCTGTCCGAAG 
ACAGGCCTTTTGGTTTTCGCTCCTCAGAGGCGACCAGCCGGGGCGTAGGGGGCCGGGTCGATGA 

>ORF31661c(SEQID NO:352) 

ACCGGCGCCTGGCTGCGCGCCAGCTTGCCGATCTGGTTGCGCAGGGCGCGCATCGGCGGCGACTCGCCGAGCAGGCGGTT 
GTCCACCGGCGCTTCCTCGGCTTCCGGGTTGCGCAAGCGTAGGGCGGTTGCCACCAGCTCCCGCAAGCGACCGAGGTCGA 
CCGGTTTGGTGAGGAAGTCGAAGGCACCGGCCTTGAGCGCCTGGATCGCGGTGTCCAGGCTGCCGTACGCGGTGATCATG 
GCCACCGGGGTCTGTGGATGGCGCTGCTGGATGTACTGGACCAGATCGAGGCCGCTGCCGTCCGGCAGGCGCATGTCGGT 
GAGGCACAGGTCGAACGGCTCGCGGGCCAGCAACTCGCGGCTTCCTTGA 

>ORF32061c (SEQ ID NO:354) 

AGGTCCTGGCGGAAGCGCCCGGCGCCGACTTCGGCGGCGAGGTCCTTGTGGGTGGCGCAGAGGATGCGCACGTGCGACGG 
CGACCTCCTGCTGGCCGCCGACCGCGCGCACGGCCTTTTCCTGGATCGCCCGGAGCAGTTTGACCTGCATGGCCATCGGC 
AGGTCGGCGACTTCGTCGAGGAACAGGGTGCCACCGCTGGCGGCCTGGAACAGGCCCTGCTTGTCTTCGATAGCGCCAGT 
GAAGCTGCCTTTCTTGTGGCCGAAGAACTCGCTTTCCATCAGCTCGGAGGGAATCGCGCCGCAGTTCACCGGCACGAACG 
GCCGCTCGATACGTGGCCCCTGCTCGTGGATCAGGCGCGCCACCAGTTCCTTGCCGCTGCCGGACTCGCCACTGATGTAG 
ACCGGCGCCTGGCTGCGCGCCAGCTTGCCGATCTGGTTGCGCAGGGCGCGCATCGGCGGCGACTCGCCGAGCAGGCGGTT 
GTCCACCGGCGCTTCCTCGGCTTCCGGGTTGCGCAAGCGTAG 

>ORF32072c (SEQ ID NO:356) 

GGCGGTAGTAGAGGTCCTGGCGGAAGCGCCCGGCGCCGACTTCGGCGGCGAGGTCCTTGTGGGTGGCGCAGAGGATGCGC 
ACGTGCGACGGCGACCTCCTGCTGGCCGCCGACCGCGCGCACGGCCTTTTCCTGGATCGCCCGGAGCAGTTTGACCTGCA 
TGGCCATCGGCAGGTCGGCGACTTCGTCGAGGAACAGGGTGCCACCGCTGGCGGCCTGGAACAGGCCCTGCTTGTCTTCG 
ATAGCGCCAGTGAAGCTGCCTTTCTTGTGGCCGAAGAACTCGCTTTCCATCAGCTCGGAGGGAATCGCGCCGCAGTTCAC 
CGGCACGAACGGCCGCTCGATACGTGGCCCCTGCTCGTGGATCAGGCGCGCCACCAGTTCCTTGCCGCTGCCGGACTCGC 
CACTGATGTAG 

>ORF31784 (SEQ ID NO:358) 

TGGAAAGCGAGTTCTTCGGCCACAAGAAAGGCAGCTTCACTGGCGCTATCGAAGACAAGCAGGGCCTGTTCCAGGCCGCC 
AGCGGTGGCACCCTGTTCCTCGACGAAGTCGCCGACCTGCCGATGGCCATGCAGGTCAAACTGCTCCGGGCGATCCAGGA 
AAAGGCCGTGCGCGCGGTCGGCGGCCAGCAGGAGGTCGCCGTCGCACGTGCGCATCCTCTGCGCCACCCACAAGGACCTC 
GCCGCCGAAGTCGGCGCCGGGCGCTTCCGCCAGGACCTCTACTACCGCCTCAACGTCATCGAGCTGCGCGTACACCGCTG 
CGCGAACGCCGCGAGGACATCCCGCTGCTCGCCGAACGCATCCTCAAGCGCCTGGCCGGCGACACCGGCCTGCCGGCCGC 
CAGGCTGA 
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>ORF32568c (SEQ ID NO:360) 

G6AGC6AAAACCAAAA66CCT6TCTTC66ACA6GCCTTTTCACTTTCAGTC6ATGCCCAGCTTTTTCA6GC66TAGC6CA 
TCGAGCGGAACGTCAGGCCCAGGCGCTGGGCCGCGGCGGTGCGGTTCCAGCGGGTCTCCTCGAGTGCCTGCATGATCAGC 
TTGCGCTCGATGTCTTCCAGGTAGTCCTCGAGGTTGTCGATTTCGCTCAGGCTCGCGGCGCCTTCCTGGCTGGCACCCGG 
CGCATCGGCCAGGCGCAGGTCGTGAGGCTGGATCTGGTCGTCTTCGCACAGGGTATAGGCGCGCTCCAGCATGTTTTCCA 
GCTCGCGGACGTTGCCCGGGAAGCGGTAGTTCTTCAGCTTCTCCTGTGCGTCGCCGGTCAGCCTGGCGGCCGGCAGGCCG 
GTGTCGCCGGCCAGGCGCTTGAGGATGCGTTCGGCGAGCAGCGGGATGTCCTCGCGGCGTTCGCGCAGCGGTGTACGCGC 
AGCTCGATGACGTTGAGGCGGTAG 

>ORF33157c (SEQ ID NO:362) 

ACAGACGGAGGTGCGCGGCTGGTTGCGCGACGGCGATCGAGTGGTCGGCGTGGCGACCTCGCGTGGCGAGATCCGTGGCG 
ACAAGGTGCTGCTGGCGGCAGGCGCCTGGAGCGGCGAGTTGTTGAAGCCGCTTGGCCTGGAACTGCCCGTGGTACCGGTG 
AAAGGTCAGATGATCCTCTACAAGTGCGCGGCGGATTTCCTGCCGCGCATGGTGCTGGCCAAGGGGCGCTACGCGATTCC 
GCGGCGCGACGGCCACATCCTGATCGGCAGCACCTTGGAACATTCGGGCTTCGACAAGACGCCGACCGACGAGGCGCTGG 
AAAGCCTCAGGGCGTCTGCGGCAGAACTGTTGCCGGAACTGGCGGACATGCAGCCGGTGGCCCACTGGGCAGGGTTGCGC 
CCGGGCTCTCCCGAAGGCATCCCCTATATCGGTCCGGTGCCTGGCTTCGACGGGCTCTGGCTGAATACCGGGCACTACCG 
CAACGGGCTGGTCCTGGCACCGGCGTCGTGCCGTCTGCTGGCGGATCTCATGAGCGGGCGGGAACCGATCATCGACCCGG 
CCCCCTACGCCCCGGCTGGTCGCCTCTGAGGAGCGAAAACCAAAAGGCCTGTCTTCGGACAGGCCTTTTCACTTTCAGTC 
GATGCCCAGCTTTTTCAGGCGGTAGCGCATCGAGCGGAACGTCAGGCCCAGGCGCTGGGCCGCGGCGGTGCGGTTCCAGC 
GGGTCTCCTCGAGTGCCTGCATGA 

>ORF32530 (SEQ ID NO:364) 

AAAGGCCTGTCCGAAGACAGGCCTTTTGGTTTTCGCTCCTCAGAGGCGACCAGCCGGGGCGTAGGGGGCCGGGTCGATGA 
TCGGTTCCCGCCCGCTCATGAGATCCGCCAGCAGACGGCACGACGCCGGTGCCAGGACCAGCCCGTTGCGGTAGTGCCCG 
GTATTCAGCCAGAGCCCGTCGAAGCCAGGCACCGGACCGATATAGGGGATGCCTTCGGGAGAGCCCGGGCGCAACCCTGC 
CCAGTGGGCCACCGGCTGCATGTCCGCCAGTTCCGGCAACAGTTCTGCCGCAGACGCCCTGAGGCTTTCCAGCGCCTCGT 
CGGTCGGCGTCTTGTCGAAGCCCGAATGTTCCAAGGTGCTGCCGATCAGGATGTGGCCGTCGCGCCGCGGAATCGCGTAG 
CGCCCCTTGGCCAGCACCATGCGCGGCAGGAAATCCGCCGCGCACTTGTAGAGGATCATCTGACCTTTCACCGGTACCAC 
GGGCAGTTCCAGGCCAAGCGGCTTCAACAACTCGCCGCTCCAGGCGCCTGCCGCCAGCAGCACCTTGTCGCCACGGATCT 
CGCCACGCGAGGTCGCCACGCCGACCACTCGATCGCCGTCGCGCAACCAGCCGCGCACCTCCGTCTGTTCATGCAACTCG 
AGATTGGCGAATTGTTGCAGGGATGCCCGCAATGA 

>ORF33705c (SEQ ID NO:366) 

GTGATATTTCTCTGTTCCTGGCAAATCGGTAGGAGCCCTGTGGTGAGTAGAGATGTAGTAGTGGTAGGCGCTGGCGTCAT 
CGGCCTGTTGACCGCCCGGGAGCTGGCGCTCGCCGGACTGCGGGTGACCCTGGTGGAGCGGGGCGAGAGTGGGCGTGAGG 
CATCCTGGGCGGGAGGCGGGATCGTCTCGCCGCTCTATCCGTGGCGCTACAGCCCGGCGGTGACCGCCCTGGCGCACTGG 
TCGCAGGACTTCTACCCGGCCCTGGGGCAGCGTTTGCTCGACGAGACCGGGCTCGATCCCGAGGTCCATACCGTTGGCCT 
GTACTGGCTGGACCTGGACGACCAGACCGAGGGICTGCAGTGGGCACGCAACCACACCCGGCCGTTGAAGGAAGTGCCGA 
TCGAGGAGGCCTACGCGGCGGTGCCCGGGCTGGGCGCAGGCTTCCAGCGGGCGGTCTACATGTCGGGCGTGGCCAATGTG 
CGCAATCCTCGCCTGGCGCGCTCATTGCGGGCATCCCTGCAACAATTCGCCAATCTCGAGTTGCATGAACAGACGGAGGT 
GCGCGGCTGGTTGCGCGACGGCGATCGAGTGGTCGGCGTGGCGACCTCGCGTGGCGAGATCCGTGGCGACAAGGTGCTGC 
TGGCGGCAGGCGCCTGGAGCGGCGAGTTGTTGAAGCCGCTTGGCCTGGAACTGCCCGTGGTACCGGTGAAAGGTCAGATG 
ATCCTCTACAAGTGCGCGGCGGATTTCCTGCCGCGCATGGTGCTGGCCAAGGGGCGCTACGCGATTCCGCGGCGCGACGG 
CCACATCCTGATCGGCAGCACCTTGGAACATTCGGGCTTCGACAAGACGCCGACCGACGAGGCGCTGGAAAGCCTCAGGG 
CGTCTGCGGCAGAACTGTTGCCGGAACTGGCGGACATGCAGCCGGTGGCCCACTGGGCAGGGTTGCGCCCGGGCTCTCCC 
GAAGGCATCCCCTATATCGGTCCGGTGCCTGGCTTCGACGGGCTCTGGCTGAATACCGGGCACTACCGCAACGGGCTGGT 
CCTGGCACCGGCGTCGTGCCGTCTGCTGGCGGATCTCATGAGCGGGCGGGAACCGATCATCGACCCGGCCCCCTACGCCC 
CGGCTGGTCGCCTCTGA 
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>ORF32832 (SEQ ID NO:368) 

6GCTTTCCAGCGCCTC6TCGGTCGGCGTCTTGTCGAAGCCCGAATGTTCCAA6GT6CTGCC6ATCA6GAT6T6GCCGTCG 
CGCCGCGGAATCGCGTAGCGCCCCTTGGCCAGCACCATGCGCGGCAGGAAATCCGCCGCGCACTTGTAGAGGATCATCTG 
ACCTTTCACCGGTACCACGGGCAGTTCCAGGCCAAGCGGCTTCAACAACTCGCCGCTCCAGGCGCCTGCCGCCAGCAGCA 
CCTTGTC(X:CAC(3^TCTCGCCACGCGAGGTCGCCACGCCGACCACTCGATCGCCGTCGCGCAACCAGCCGCGCACCTCC 
GTCTGTTCATGCAACTCGAGATTGGCGAATTGTTGCAGGGATGCCCGCAATGAGCGCGCCAGGCGAGGATTGCGCACATT 
GGCCACGCCCGACATGTAGACCGCCCGCTGGAAGCCTGCGCCCAGCCCGGGCACCGCCGCGTAGGCCTCCTCGATCGGCA 
CTTCCTTCAACGGCCGGGTGTGGTTGCGTGCCCACTGCAGTGCCTCGGTCTGGTCGTCCAGGTCCAGCCAGTACAGGCCA 
ACGGTATGGACCTCGGGATCGAGCCCGGTCTCGTCGAGCAAACGCTGCCCCAGGGCCGGGTAGAAGTCCTGCGACCAGTG 
CGCCAGGGCGGTCACCGCCGGGCTGTAGCGCCACGGATAGAGCGGCGAGACGATCCCGCCTCCCGCCCAGGATGCCTCAC 
GCCCACTCTCGCCCCGCTCCACCAGGGTCACCCGCAGTCCGGCGAGCGCCAGCTCCCGGGCGGTCAACAGGCCGATGA 

>ORF33547c (SEQ ID NO:370) 

GGCATCCTGGGCGGGAGGCGGGATCGTCTCGCCGCTCTATCCGTGGCGCTACAGCCCGGCGGTGACCGCCCTGGCGCACT 
GGTCGCAGGACTTCTACCCGGCCCTGGGGCAGCGTTTGCTCGACGAGACCGGGCTCGATCCCGAGGTCCATACCGTTGGC 
CTGTACTGGCTGGACCTGGACGACCAGACCGAGGCACTGCAGTGGGCACGCAACCACACCCGGCCGTTGAAGGAAGTGCC 
GATCGAGGAGGCCTACGCGGCGGTGCCCGGGCTGGGCGCAGGCTTCCAGCGGGCGGTCTACATGTCGGGCGTGGCCAATG 
TGCGCAATCCTCGCCTGGCGCGCTCATTGCGGGCATCCCTGCAACAATTCGCCAATCTCGAGTTGCATGA 

>ORF33205 (SEQ ID NO:372) 

GCGCGCCAGGCGAGGATTGCGCACATTGGCCACGCCCGACATGTAGACCGCCCGCTGGAAGCCTGCGCCCAGCCCGGGCA 
CCGCCGCGTAGGCCTCCTCGATCGGCACTTCCTTCAACGGCCGGGTGTGGTTGCGTGCCCACTGCAGTGCCTCGGTCTGG 
TCGTCCAGGTCCAGCCAGTACAGGCCAACGGTATGGACCTCGGGATCGAGCCCGGTCTCGTCGAGCAAACGCTGCCCCAG 
GGCCGGGTAGAAGTCCTGCGACCAGTGCGCCAGGGCGGTCACCGCCGGGCTGTAGCGCCACGGATAGAGCGGCGAGACGA 
TCCCGCCTCCCGCCCAGGATGCCTCACGCCCACTCTCGCCCCGCTCCACCAGGGTCACCCGCAGTCCGGCGAGCGCCAGC 
TCCCGGGCGGTCAACAGGCCGATGACGCCAGCGCCTACCACTACTACATCTCTACTCACCACAGGGCTCCTACCGATTTG 
CCAGGAACAGAGAAATATCACTCAAAGGGATCAGATGCTGACGAATTGCCTGCTTCAACGAACTCAGTCGAATCTAGTCC 
CGGTGAAAAGCCCATCATACCCGCAGAGGTATTCATCCCATGA 

>ORF33512 (SEQ ID NO.-374) 

AGCGGCGAGACGATCCCGCCTCCCGCCCAGGATGCCTCACGCCCACTCTCGCCCCGCTCCACCAGGGTCACCCGCAGTCC 
GGCGAGCGCCAGCTCCCGGGCGGTCAACAGGCCGATGACGCCAGCGCCTACCACTACTACATCTCTACTCACCACAGGGC 
TCCTACCGATTTGCCAGGAACAGAGAAATATCACTCAAAGGGATCAGATGCTGACGAATTGCCTGCTTCAACGAACTCAG 
TCGAATCTAGTCCCGGTGAAAAGCCCATCATACCCGCAGAGGTATTCATCCCATGAAATCGAGTGGTTTGAATTTGGTGG 
AACTATCGATAGTCCTATCGATCCTTGCGATAGGCGTGACAATTGCGCTGCCCACCCTCCCCGACAGAATGAAGCGGGAC 
ATTAG 

>ORF33771(SEQID NO:376) 

AAAGCCCATCATACCCGCAGAGGTATTCATCCCATGAAATCGAGTGGTTTGAATTTGGTGGAACTATCGATAGTCCTATC 
GATCCTTGCGATAGGCGTGACAATTGCGCTGCCCACCCTCCCCGACAGAATGAAGCGGGACATTAGCCGTGATATTGGTG 
ACAGCCTGACTAGTCATGTGATGGCTGCGCGGGCTAGCAGCATACAGAACGGCGTGATCATCGAGGTGTGCGGTAGCGGT 
GACGGCAGTACCTGCAGCGAGGAATGGCATCTCGGCTGGTTCAGCCGTAACGACAGGAGCCAACAGATACTGGCCCGGCA 
TGAAAATACGAGTCGCACCGATATTCATTGGCGGGGCTTCGACMGCGACTGCGCTACCTGCCrAATGGCACCAGCCCTA 
CAGGTAACGGGCGTTTCTTCGAATGTAAGGACGATCGCATCGAGTGGCAATTGGTGCTCAATCGGCAAGGCCGCCTCAGG 
GTGGCGGGAAAGAGCGAAAATAAAAAGCTCTCTTACCTGTGCTCCAGGCGGTGA 

>ORF34385c (SEQ ID NO:378) 

TGGAGAGCGCATTGTCCCTGTAGCAGAGACAGCCGGAGCGGAGAGTGGGATGACTGGCAAACGGTATGTGAAACAGTTCT 
CTCACCGCCTGGAGCACAGGTAAGAGAGCTTTTTATTTTCGCTCTTTCCCGCCACCCTGAGGCGGCCTTGCCGATTGAGC 
ACCAATTGCCACTCGATGCGATCGTCCTTACATTCGAAGAAACGCCCGTTACCTGTAGGGCTGGTGCCATTAGGCAGGTA 
GCGCAGTCGCTTGTCGAAGCCCCGCCAATGAATATCGGTGCGACTCGTATTTTCATGCCGGGCCAGTATCTGTTGGCTCC 
TGTCGTTACGGCTGAACCAGCCGAGATGCCATTCCTCGCTGCAGGTACTGCCGTCACCGCTACCGCACACCTCGATGATC 
ACGCCGTTCTGTATGCTGCTAGCCCGCGCAGCCATCACATGACTAGTCAGGCTGTCACCAATATCACGGCTAATGTCCCG 
CTTCATTCTGTCGGGGAGGGTGGGCAGCGCAATTGTCACGCCTATCGCAAGGATCGATAG 
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>ORF33988 (SEQ ID NO:380) 

TCATC6AGGTGTGCGGTAGCG6TGACG6CAGTACCTGCA6CGA6GAATGGCATCTCGGCTGGTTCAGCCGTAACGACA6G 
AGCCAACAGATACTGGCCCGGCATGAAAATACGAGTCGCACCGATATTCATTGGCGGGGCTTCGACAAGCGACTGCGCTA 
CCTGCCTAATGGCACCAGCCCTACAGGTAACGGGCGTTTCTTCGAATGTAAGGACGATCGCATCGAGTGGCAATTGGTGC 
TCAATCGGCAAGGCCGCCTCAGGGTGGCGGGAAAGAGCGAAAATAAAAAGCTCTCTTACCTGTGCTCCAGGCGGTGAGAG 
AACTGTTTCACATACCGTTTGCCAGTCATCCCACTCTCCGCTCCGGCTGTCTCTGCTACAGGGACAATGCGCTCTCCACT 
AG 

>ORF34274 (SEQ ID NO:382) 

AAAGCTCTCTTACCTGTGCTCCAGGCGGTGAGAGAACTGTTTCACATACCGTTTGCCAGTCATCCCACTCTCCGCTCCGG 
CTGTCTCTGCTACAGGGACAATGCGCTCTCCACTAGGCAAGATTATCTGGCCCTTTTCCTTGTGGAGTACTGCATGCGCT 
CTATTTGTCGCAGCGCCGGCTTTTCCCTGATCGAGTTGATGATGGTGTTGGTTCTGGTCGCCATATTCGCCAGCATTGCC 
GTACCCAGTTTCAACGCCTTGATCGAGCGCAACCGAATCCAGACTGCCAGCGAGGAACTCTACAGCCTGCTTCAGTACGC 
TCGCAGCGAAGCTGTAAACCGTCATGCCAATGTGAGCATCAGGGCGACGCAGAA(^ATGACTGGGCAAAAGGCCTGGAAA 
TCATCAGCGGCGCGACCACCGTGCAAAAGCACCAAGGTTTCCAGCAGGTCTCGCTATCCGCCAGCAGTGCGACTGCGGAG 
CTGACCTTCAACGCTACCGGCACACTTAGCAACCAGGCTGCAAACATTGACATAAAGGTCTGCTTCGCCGGTGAQAAAG 
TACAGGACGTCTGCTTACCGTTCAGCCCAGTGGACGCGTGATCCTGTACCCATCTTCAAAGCAACCGGACAGCTGTAACT 
GA 

>ORF34726c(SEQID NO:384) 

CGAGACCTGCTGGAAACCTTGGTGCTTTTGCACGGTGGTCGCGCCGCTGATGATTTCCAGGCCTTTTGCCCAGTCATTGT 
TCTGCGTCGCCCTGATGCTCACATTGGCATGACGGTTTACAGCTTCGCTGCGAGCGTACTGAAGCAGGCTGTAGAGTTCC 
TCGCTGGCAGTCTGGATTCGGTTGCGCTCGATCAAGGCGTTGAAACTGGGTACGGCAATGCTGGCGAATATGGCGACCAG 
AACCAACACCATCATCMCTCGATCAGGGAAAAGCCGGCGCTGCGACAAATAGAGCGCATGCAGTACTCCACAAGGAAAA 
GGGCCAGATAATCTTGCCTAGTGGAGAGCGCATTGTCCCTGTAGCAGAGACAGCCGGAGCGGAGAGTGGGATGACTGGCA 
AACGGTATGTGAAACAGTTCTCTCACCGCCTGGAGCACAGGTAA 

>ORF34916 (SEQ ID NO:386) 

GGAAAGCCCATGTCTCGAGAAACGGGTTTCAGCATGATCGAAGTACTGGTTGCTCTGGTGCTGATCAGCATTGGCGTACT 
GGGCATGGTTGCCATGCAAGGGCGCACGATCCAGTACACGCAGGAGTCGGTACAACGCAATGCCGCAGCAATGCTTGCTA 
GCGACCTGATGGAAATAATGCGTGCGGACCCAGATGCCGTACTCAATCTACGCGCCCAACTACGCGAAGACTCGGTCTAC 
TACAAGGCCAAGGGCAGCGACTTTCCCGCAGCCCCAGCGCGCTGCGCGCCATTGCCAGCAGATGCTAAGGAACGTCTCGG 
CTGCTGGGCCCAACAGGCCTCGAAAGACTTGCCGGGAGCCTCCGCACTCTTGAATAGCCAATTCTACATTTGTCGCAGCC 
CAACCCCGGGTACCTGCGACAACACCAAAGGCTCGGCCATCGAAATCCAGGTTGCCTGGCGAGCCATGGATGGAGCGTGT 
TTCAACGCCTCTGACTCCACCTTGTGCACCTACAGCGTCCGCTCCGAATTGTGA 

>ORF35464c (SEQ ID NO:388) 

AGAG(»TGCTTGTTCTCACAAnCGGAGCGGACGCTGTAGGTGCACAAGGTGGAGTCAGAGGCGTTGAAACACGCTCCAT 
CCATGGCTCGCCAGGCAACCTGGATTTCGATGGCCGAGCCTTTGGTGTTGTCGCAGGTACCCGGGGTTGGGCTGCGACAA 
ATGTAGAATTGGCTATTCAAGAGTGCGGAGGCTCCCGGCAAGTCTTTCGAGGCCTGTTGGGCCCAGCAGCCGAGACGTTC 
CTTAGCATCTGCTGGCAATGGCGCGCAGCGCGCTGGGGCTGCGGGAAAGTCGCTGCCCTTGGCCTTGTAGTAGACCGAGT 
CTTCGCGTAG 

>ORF35289 (SEQ ID NO:390) 

ATAGCCAATTCTACATTTGTCGCAGCCCAACCCCGGGTACCTGCGACAACACCAAAGGCTCGGCCATCGAAATCCAGGTT 
GCCTGGCGAGCCATGGATGGAGCGTGTTTCAACGCCTCTGACTCCACCTTGTGCACCTACAGCGTCCGCTCCGAATTGTG 
AGAACAAGCATGCTCTTCAGCAAAATGCAGAAAGGCCTATCGATGGTAGAACTGCTCGTGGCACTCGCTATAAGCAGCTT 
CCTGATCCTGGGGATCAGCCAGATCTACATCGACAACAAACGCAACTATCTTTTCCAGCAAGGCCAGGCCGGCAACCAGG 
AAAATAG 
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>ORF35410 {SEQ ID NO:392) 

CTCCACCTTGT6CACCTACAGCGTCC6CTCC6AATTGTGAGAACAASCATGCTCTTCAGCAAAAT6CAGAAA6GCCTATC 
GATGGTA(^CTGCTCGTGGCACTCGCTATAAGCA(K:TTCCTGATCCTG(MGATCAGCCAGATCTACATCGACAACAAAC 
GCAACTATCTTTTCCAGCMGGCCAGGCC^ 

GATAAGACAGCCTATCGTCGCCTTCACGACGACAACATGGAGAATGCTTTCAAATCCGCGACATTCAATGGCTGTCGTGC 
ATTTGTGGCTGGCGAGACTATCGCTGCGGCAACTGCCCTCAAGGCGGGTGAGTACGGTGTCTGCTTGCGCTATCAACCCG 
CCTACAAAGGGGAGCATGATTGCCTCGGTAATGAAATTACCGGAGTTCCGGAAAAGCCCTTCACAAATACTCCCCCTGTC 
GTCGTTCGCCTGGTCTACCTACCGAGCGCCGGTACCCTGAGTTGCAGTCGTCCCGATATCGCCCAGTCGAAATCGGGAGA 
ATTGGTCAGTGGTCTCACAGACTTCCGCTTGGAAGCGGGGGTCGGGCCAGCAGATCGTAGCGAACGCAAAGTATCCAGCT 
TCGTCGCACTACAGGATGTCGCCGGTCGTCCTATCCGAGCATTGCGCTTCTCAATCCTGGCAGGCAGCGACAATACAAGC 
CTGCGCACAGGAGATGATAGCCAGGCACGCGATCGCTGGATCGTCCTTTATCCCGAGAGCAAAAGCGCCATCGAGGCCGC 
AGACAAAGGCCAGATTTACCAAATAGCGCGTGGTAACCAAACCATCAGGAATCTCATGCCATGA 

>ORF35907c (SEQ ID NO:394) 

GTAGACCAGGCGAACGACGACAGGGGGAGTATTTGTGAAGGGCTTTTCCGGAACTCCGGTAATTTCATTACCGAGGCAAT 
CATGCTCCCCTTTGTAGGCGGGTTGATAGCGCAAGCAGACACCGTACTCACCCGCCTTGAGGGCAGTTGCCGCAGCGATA 
GTCTCGCCAGCCACAAATGCACGACAGCCATTGAATGTCGCGGATTTGAAAGCATTCTCCATGTTGTCGTCGTGAAGGCG 
ACGATAGGCTGTCTTATCCAGTTGTTGCTGCAGCAGCATAAGAACGAAGCGGCTATTTTCCTGGTTGCCGGCCTGGCCTT 
GCTGGAAAAGATAGTTGCGTTTGTTGTCGATGTAGATCTGGCTGATCCCCAGGATCAGGAAGCTGCTTATAGCGAGTGCC 
ACGAGCAGTTCTACCATCGATAG 

>ORF35534 (SEQ ID NO:396) 

TCCTGGGGATCAGCCAGATCTACATCGACAACAAACGCAACTATCTTTTCCAGCAAGGCCAGGCCGGCAACCAGGAAAAT 
AGCCGCTTCGTTCTTATGCTGCTGCAGCAACAACTGGATAAGACAGCCTATCGTCGCCTTCACGACGACAACATGGAGAA 
TGCTTTCAAATCCGCGACATTCAATGGCTGTCGTGCATTTGTGGCTGGCGAGACTATCGCTGCGGCAACTGCCCTCAAGG 
CGGGTGAGTACGGTGTCTGCTTGCGCTATCAACCCGCCTACAAAGGGGAGCATGATTGCCTCGGTAATGAAATTACCGGA 
GTTCCGGAAAAGCCCTTCACAAATACTCCCCCTGTCGTCGTTCGCCTGGTCTACCTACCGAGCGCCGGTACCCTGA 

>ORF35930 (SEQ ID NO:398) 

GTTGCAGTCGTCCCGATATCGCCCAGTCGAAATCGGGAGAATTGGTCAGTGGTCTCACAGACTTCCGCTTGGAAGCGGGG 
GTCGGGCCAGCAGATCGTAGCGAACGCAAAGTATCCAGCTTCGTCGCACTACAGGATGTCGCCGGTCGTCCTATCCGAGC 
ATTGCGCTTCTCAATCCTGGCAGGCAGCGACAATACAAGCCTGCGCACAGGAGATGATAGCCAGGCACGCGATCGCTGGA 
TCGTCCTTTATCCCGAGAGCAAAAGCGCCATCGAGGCCGCAGACAAAGGCCAGATTTACCAAATAG 

>ORF36246 (SEQ ID NO:400) 

CCAAACCATCAGGAATCTCATGCCATGACCCTGCGCCATACCTCTCGACAGCAGGGATCCACGTTGTTGATCTCGCTGGT 
TATCTTGTTGATGATCACGCTCCTCGCCGTTTCCAACATGCGCGAGGTGTCACTGGAAAGCCGTATCACCGGCAATCTCA 
TCGAACAGAAGCGCCTGCGCAATGCGGGCGAAGCTGGGCTACGCGAAGGTGAACGACGCTTTTTCAATACCATCAAGCCC 
CCAGAGGTCGGCAGCGGATGCGCCGATAGCAATGTCAAACGGCCTTGCATACTGAACCTGAGTGCCCTCTCCGTACCCCG 
AGATGACGTGCACAACAATCCGGTGGCAGCCCTGAACGGCAAGACAGATAACGCCAATTCACGTGTCTGGATGCCCTACC 
GAGGCAGCGATCTGAATAACCCTACGCAGATCGACAAAGACCGCGCAGTCACCTGGCAGACCATCACGGTGCCCGCTGGC 
GAACAGAACAACGAAGCGGAAAATCCCGAGTACGGCAACATGATGCGCGGGGTCGGCACGTTCTACTACGAAACCAACAG 
CCGCGCCCTCAACAAGGCGGGCGGAGAGACTGTTCTACAGGCCGTTCATGCACGCCTGTATACCAACTGA 

>ORF26640c (SEQ ID NO:402) 

GGCATCCAGACACGTGAATTGGCGTTATCTGTCTTGCCGTTCAGGGCTGCCACCGGATTGTTGTGCACGTCATCTCGGGG 
TACGGAGAGGGCACTCAGGTTCAGTATGCAAGGCCGTTTGACATTGCTATCGGCGCATCCGCTGCCGACCTCTGGGGGCT 
TGATGGTATTGAAAAAGCGTCGTTCACCTTCGCGTAGCCCAGCTTCGCCCGCATTGCGCAGGCGCTTCTGTTCGATGAGA 
TTGCCGGTGATACGGCTTTCCAGTGACACCTCGCGCATGTTGGAAACGGCGAGGAGCGTGATCATCAACAAGATAACCAG 
CGAGATCAACAACGTGGATCCCTGCTGTCGAGAGGTATGGCGCAGGGTCATGGCATGA 
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>ORF36769 (SEQ ID NO:404) 

T6C6C66(KTC6(^C6TTCTACTAC{^CCAACAGCCGCGCCCTCAACAAG6CG66CG6A6AGACT6TTCTACA6GCC 
GTTCATGCACGCCTGTATACCAACTGACTGGAGCCAGCGCATGATCCACCAGATTACCCGCGCAGGAAAAAGCCTGCTGG 
CTGCAGGTTGCACCCTGAGCATCCTGTTCGCCTCTGACAGTTATGCCGCCACGGCCCTGAATGTCAGCCAGCAACCCCTG 
TTCCTMCCCAGGGCGTTGCTCCCAACCTGCTGTTCACTCTAGATGACTCAGGCAGTATGGCCTGGGCTTACGTGCCCGA 
CGGTATTAGCGGGAATAGCGGCAGAGCGGGACGTTCCAGCGATTACAACGCACTGTACTACAACCCCGATTATGCTTACC 
AAGTGCCCAAGAAATTGACACTGTCAGGCGATCAGATCATCGTTTCCGACTATCCAGTGCCACGCTTCACAGCAGCCTGG 
CAGGATGGCTACGCCCAAGGCTCCACCACCAACCTGAGCAATAACTATCGCCCTCAATGGGGAACCGGCTGGCTTGGTTG 
CATCGATAGCAGCTGCAATACCGGGAGAGCTTATTACTATACTTATAAGGTAAGCGCTAGCTGCCCTGCACAGCCGGTGA 
GCAGCTCCAACTCCTGTTATACCTACAATGCTCTTC™ 

CGCAACCGCATCCTGGCCACAAAGACCGCTGCCAACCTGGCCTTTTACAGCCTGCCGGAAAACGTGCGTCTCACTTGGGG 
GGCCCTGAACACCTGTAGCATCGGCGCCAACAGCAGAAGCTGCCAAAACAATGCCCTGCTCCAATTCAACAAGCAGCACA 
AAATCAATTTCTTCAATTGGCTGGCGAACAGCCCGGCCAGCGGCGGTACTCCTCTGCATGCGGCTCTTGACCGAGCCGGA 
CGCTTCTTGCAAACCAACGGCACAGCTTATACCACCGAAGACGGAAAGACATATTCCTGCCGGGCCAGCTATCACATCAT 
GATGACCGACGGTATCTGGAACGGTCGGAACGTCACCCCCGGCAATCTCGACAACCAGAACCAGACCTTTCCTGATAGCA 
CCCTCTATAGGCCACAGCCCCCTTATGCCGACAGCAATGCCAGCTCATTGGCTGACCTGGCTTTCAAATACTGGACCACA 
GACTTACGTCCCAGCATCGACAATGACCTGAAGCCTTTCATGGCCTACAAGAGTGGGGACGATTCCAAGGATTACTGGGA 
CCCTCGCAACAACCCAGCCACTTGGCAACACATGGTCAACTTTACCGTTGGCCTAGGTCTTTCCTATTCGCTCACATTGA 
ACTCTGCACCMCTTGGACAGGCAGCACCTTTGGCAACTACGAGGAGTTGATGGCTGGAAGCAAGGCTTGGCCCAGCGTC 
GATAACGACGCCGCACCCGGTAACGTCTACGACCTCTGGCATGCAGCTATCAACTCTCGTGGAGACTTCTTTAGCGCGGA 
ATCACCGGACTCTCTGGTTCAGGCTTTCAATAAGATCCTGACACGGATTTCCGAGCGCAACACCTCCTCCTCCAAACCAG 
CAATGACTTCCGCGCTGCAGGATGACGGAACCGGCGACAAGCTGATCCGCTACAGCTACCAGTCCAGCTTTGCCAGTGAC 
AAGAACTGGGCGGGCGACCTTATACGTTACAAGGTGGAGTCGACTTCCACCGGTTCGACCAAAACCCAGGAATGGAGCGC 
CGGCGCACTGCTGGACAACCGAGCTCCCGCTACCCGTAATATTTACATCGCCAGCAATAGCGGAACCAACCGCCTTAAGC 
CTTTCACATGGAGCAATATTGAGGGAAGTCAGTTAGCCACTTGGCTGAACCGCAACCCGGACAAGGACAATCAGGCCGAC 
ACCAAAGGAGCACAGCGGGTCGACTTCATCCGTGGCCAGCAGAATATGGATGGATTCCGGCAACGACAGGCGGTGTTAGG 
GGACATCGTGCACTCGTCTCCAGCCGTGGTCGGACCGGCCCAATACCTCACTTATCTGGCCAACCCCATCGAACCCAGCG 
GCGACTACGGCACATTCAAGACAGAGGCAGACCAGCGCAGCCCTAGAGTTTATGTTGGATCCAACGATGGCATGTTGCAT 
GGTTTCAACATCAAMCCGGCGTGGAAGAGTTCGCTTTCATCCCTACAGCAGTAnCGAAAAGCTTAACAAGCTTACCGG 
CATCAGCTACCAGGGCGGTGCCCACCAATATTTCGTCGACGCTACACCGGTCGTCAGCGATGCCTTTTTCGATGGAGCTT 
GKACACTGTTCTISATCGGMCGCTTtWTGCTGGAGGTCGCGKCTGTTCGCACTCGATGTMCCMGCCGGACGATGTC 
AAGCTGCTTTGGGAATACGATAGCAGTACCGACTCGGACCTTGGTTACACCTTCTCCAAACCTACCGTAGCCAGACTGCA 
CAGCGIGACAATGGGCAGTAGTTACCGGCAACGGCTATGGAAGCGATAATGACAAGGCAGCTTTACTGCTGATTGATTTGA 
AAMGGGAACGCTGATCAAGAAGCTGGAAGTCCAAAGCGAGCGCGGAATAGCCAATGGCCTATCGACGCCTCGCCTGGCT 
GATAACAACAGCGATGGCATTGCTGACTACGCCTATGCTGGCGATCTGCAGGGAAATATCTGGCGCTTCGATTTGATCGG 
CAATACCCGCAACGACGACCCAGACACAAATACCTCTATCAATCCCTTCAAGCCCGGAGATGTAGATCCTTCTGCTTTCA 
GAGTATCGTTCAGCGGCGCCCCGCTTTTCCGTGCTCGCGCCGACAACAATACTCGTCAGCCCATCACGGCTCCGCCTACC 
TTGGTACGCCATCCTAGCCGTAAGGGCTACATCGTCATCGTAGGTACAGGAAAATACTTCGAGGACGATGACGCTCAGGC 
CGATACCAGCCGAGCCATGACGCTCTATGGTATCTGGGATCGCCAGACCAAGGGCGAAAGCGCAAACAGTACCCCAACCA 
TCGACCGCAACGCCCTCACAGCCCAAACCATGACAACAGAGGCGAACTCCACATTCGGTAGCGTGAACAGGAATATTCGG 
CTTATTAGCCAAAACCCGGTGAAGTGGTACAAAGACGGAGCAACCGGTACCGCGAACTCGGATGTGGCTAGCTATGGCTG 
GCGACTGAATCTGGAGGTCAATAGCAGCAAGAAAGGCGAAATGATGATCGAAGATATGTTCGCTGCCGGCCAAGTGCTTC 
TATTGCAGACCTTGACACCGAACGACGACCCTTGTGACAGCGGCTCTACCAGCTGGACCTACGGCCTCAATCCATATACT 
GGCGGACGTACCAGTTTCACCGTCTTCGATCTCAAACGTGCGGGTATAGTGGACTCTGGCTCGGATTACAACGGCTCGGT 
CGTATCCGCCTTCCAACAGGATGGACTAGGTGGCTTGGCCATTACCCAGAACGAACAGCGTCAATCCGAGGCTTGCACTG 
GTGATGAGTGCATCATCTTCAACCCCAGCGACAAGAGTAACGGACGACAAACCTGGCGGGTCGTCGAGGAGAAATGA 
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>ORF37932c (SEQID NO:406) 

GCTGGCATTGCTGTCGGCATAAGGGGGCTGTGGCCTATAGAGGGTGCTATCAGGAAAGGTCTGGTTCTGGTTGTCGA6AT 

TGCCGGGGGTGACGTTCCGACCGTTCCAGATACCGTCGGTCATCATGATGTGATAGCTGGCCCGGCAGGAATATGTCTTT 

CCGTCTTCGGTGGTATAAGCTGTGCCGTTGGTTTGCAAGAAGCGTCCGGCTCGGTCAAGAGCCGCATGCAGAGGAGTACC 

GCCGCTGGCCGGGCTGTTCGCCAGCCAATTGAAGAAATTGATTTTGTGCTGCTTGTTGAATTGGAGCAGGGCATTGTTTT 

GGCAGCTTCTGCTGTTGGCGCCGATGCTACAGGTGTTCAGGGCCCCCCAAGTGAGACGCACGTTTTCCGGCAGGCTGTAA 

AA(MCQGGTTGGCAGCGGTCTTTGT(MCCAGGAT(M:(KTTGC(3ATAGTAG(5AGTACCATATCGCAAAGTTG^ 

ACTGGTAGGAAGAGCATTGTAGGTATAACAGGAGTTGGAGCTGCTCACCGGCTGTGCAGGGCAGCTAGCGCTTACCTTAT 

AAGTATAGTAATAAGCTCTCCCGGTATTGCAGCTGCTATCGATGCAACCAAGCCAGCCGGTTCCCCATTGAGGGCGATAG 

TTATTGCTCAGGTTGGTGGTGGAGCCTTGGGCGTAGCCATCCTGCCAGGCTGCTGTGAAGCGTGGCACTGGATAGTCGGA 

AACGATGATCTGATCGCCTGA 

>ORF38640c (SEQ ID NO:408) 

CTGACTTCCCTCAATATTGCTCCATGTGAAAGGCTTAAGGCGGTTGGTTCCGCTATTGCTGGCGATGTAAATATTACGGG 
TAGCGGGAGCTCGGTTGTCCAGCAGTGCGCCGGCGCTCCATTCCTGGGTTTTGGTCGAACCGGTGGAAGTCGACTCCACC 
TTGTAACGTATAAGGTCGCCCGCCCAGTTCTTGTCACTGGCAAAGCTGGACTGGTAGCTGTAGCGGATCAGCTTGTCGCC 
GGTTCCGTCATCCTGCAGCGCGGAAGTCATTGCTGGTTTGGAGGAGGAGGTGTTGCGCTCGGAAATCCGTGTCAGGATCT 
TATTGAAAGCCTGAACCAGAGAGTCCGGTGA 

>ORF39309c (SEQ ID NO:410) 

AGCTGCCTTGTCATTATCGCTTCCATAGCCGTTGCCGGTAACTACTGCCCATTGTCCGCTGTGCAGTCTGGCTACGGTAG 
GTTTGGAGAAGGTGTAACCAAGGTCCGAGTCGGTACTGCTATCGTATTCCCAAAGCAGCTTGACATCGTCCGGCTTGGTT 
ACATCGAGTGCGAACAGGCCGCGACCTCCAGCACCAAGCGTTCCGATCAGAACAGTGTGCCAAGCTCCATCGAAAAAGGC 
ATCGCTGACGACCGGTGTAGCGTCGACGAAATATTGGTGGGCACCGCCCTGGTAGCTGATGCCGGTAAGCTTGTTAAGCT 
TTTCGAATACTGCTGTAGGGATGAAAGCGAACTCTTCCACGCCGGTTTTGATGTTGAAACCATGCAACATGCCATCGTTG 
GATCCAACATAAACTCTAGGGCTGCGCTGGTCTGCCTCTGTCTTGAATGTGCCGTAGTCGCCGCTGGGTTCGATGGGGTT 
GGCCAGATAAGTGAGGTATTGGGCCGGTCCGACCACGGCTGGAGACGAGTGCACGATGTCCCCTAA 

>ORF38768 (SEQ ID NO:412) 

GGGACATCGTGCACTCGTCTCCAGCCGTGGTCGGACCGGCCCAATACCTCACTTATCTGGCCAACCCCATCGAACCCAGC 
GGCGACTACGGCACATTCAAGACAGAGGCAGACCAGCGCAGCCCTAGAGTTTATGTTGGATCCAACGATGGCATGTTGCA 
TGGTTTCAACATCAAAACCGGCGTGGAAGAGTTCGCTTTCATCCCTACAGCAGTATTCGAAAAGCTTAACAAGCTTACCG 
GCATCAGCTACCAGGGCGGTGCCCACCAATATTTCGTCGACGCTACACCGGTCGTCAGCGATGCCTTTTTCGATGGAGCT 
TGGCACACTGTTCTGA 

>ORF40047c (SEQ ID NO:414) 

AAGCACTTGGCCGGCAGCGAACATATCTTCGATCATCATTTCGCCTTTCTTGCTGCTATTGACCTCCAGATTCAGTCGCC 
AGCCATAGCTAGCCACATCCGAGTTCGCGGTACCGGTTGCTCCGTCTTTGTACCACTTCACCGGGTTTTGGCTAATAAGC 
CGAATATTCCTGTTCACGCTACCGAATGTGGAGTTCGCCTCTGTTGTCATGGTTTGGGCTGTGAGGGCGTTGCGGTCGAT 
GGTTGGGGTACTGTTTGCGCTTTCGCCCTTGGTCTGGCGATCCCAGATACCATAGAGCGTCATGGCTCGGCTGGTATCGG 
CCTGAGCGTCATCGTCCTCGAAGTATTTTCCTGTACCTACGATGACGATGTAGCCCTTACGGCTAGGATGGCGTACCAAG 
GTAGGCGGAGCCGTGATGGGCTGACGAGTATTGTTGTCGGCGCGAGCACGGAAAAGCGGGGCGCCGCTGAACGATACTCT 
GAAAGCAGAAGGATCTACATCTCCGGGCTTGAAGGGATTGATAGAGGTATTTGTGTCTGGGTCGTCGTTGCGGGTATTGC 
CGATCAAATCGAAGCGCCAGATATTTCCCTGCAGATCGCCAGCATAGGCGTAGTCAGCAATGCCATCGCTGTTGTTATCA 
GCCAGGCGAGGCGTCGATAG 

>ORF40560c (SEQ ID NO:416] 

CCGGCGAGTCCTGTTGTTGGACACGGTTGGGCAAGCGATATGTCTGCCCATCGACTACTACCAGACCGGCGGCAGGATGA 
ACATCCTCGACCACGCCCACATTCTCGAACGTATTCGTGGCACTCAAGGCAAAGGTTGGGCAAGCCAGAGCTAGAGCTGC 
AAGAGCTGTGGCGAGAAGACGTAAGGGGTTCATGTTCATTTCTCCTCGACGACCCGCCAGGTTTGTCGTCCGTTACTCTT 
GTCGCTGGGGTTGAAGATGATGCACTCATCACCAGTGCAAGCCTCGGATTGACGCTGTTCGTTCTGGGTAATGGCCAAGC 
CACCTAG 
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>ORF40238 (SEQID NO:418) 

GTG6CTTGGCCATTACCCAGAACGAACA6CGTCAATCCGAGGCTT6CACT66TGATGAGT6CATCATCTTCAACCCCAGC 
GACAAGAGTAACGGACGACAAACCTGGC6GGTCGTCGAGGAGAAATGAACATGAACCCCTTACGTCTTCTCGCCACAGCT 
CTTGCAGCTCTAGCTCTGGCTTGCCCAACCTTTGCCTTGAGTGCCACGAATACGTTCGAGAATGTGGGCGTGGTCGAGGA 
TGTTCATCCTGCCGCCGGTCTGGTAGTAGTCGATGGGCAGACATATCGCTTGCCCAACCGTGTCCAACAACAGGACTCGC 
CGGTCATATTCTTGGTACGTCAGGGACAGACAGTGTCTTTCTCCGGCAAACTCACCAGCGACCTGCCAGAAATCGAGTCG 
TTCTACATTATCAAGCAGGCCCCTCTCGTTCCCTTCGGATCGGAGCAGCAACAATGAAGTCGAACAGAGGCTTCACTCTC 
ATCGAGTTGATGATCGTCGTAGTAATCATCGCTATTCTTGCTGGTATCGCCTACCCCAGCTACGACGAATACGTGAAGCG 
CGGGAATCG^CCGAAGGACAGGCATTACTCAGCGAAGCAGCCGCTACTCAAGAGCGCTATTTTTCACAGAACAATACTT 
ATATCACTACCCMGCCGACATCGGCMGCTGCATC^^ 

TACAGCCTTACCGTCGATACGGTAGCCAACGACGGAGGTTATCGCCTTATCGCTAACCAGGCATTCAACGATCTTGATTG 
TGGCAACCTGACCTTGACCGCCAACGGCGAGAAAGGCCGGACTGGAAGCAAGAAGAGCGTTGCAGAATGCTGGCGCTAA 

>ORF40329 (SEQ ID NO:420) 

CGGACGACAAACCTGGCG(WTCGTCGAG(3AGAMTGAACATGAACCCCTTACGTCTTCTCGCCACAGCTCTTGCAGCTCT 
AGCTCTGGCTTGCCCAACCTTTGCCTTGAGTGCCACGAATACGTTCGAGAATGTGGGCGTGGTCGAGGATGTTCATCCTG 
CCGCCGGTCTGGTAGTAGTCGATt&KAGACATATCGOT^ 

TTGGTACGTCAGGGACAGACAGTGTCTTTCTCCGGCAAACTCACCAGCGACCTGCCAGAAATCGAGTCGTTCTACATTAT 
CAAGCAGGCCCCTCTCGTTCCCTTCGGATCGGAGCAGCAACAATGA 

>ORF40709c (SEQ ID NO:422) 

AGCCTCTGTTCGACTTCATTGTTGCTGCTCCGATCCGAAGGGAACGAGAGGGGCCTGCTTGATAATGTAGAACGACTCGA 
TTTCTGGCAGGTCGCTGGTGAGTTTGCCGGAGAAAGACACTGTCTGTCCCTGACGTACCAAGAATATGACCGGCGAGTCC 
TGTTGTTGGACACGGTTGGGCAAGCGATATGTCTGCCCATCGACTACTACCAGACCGGCGGCAGGATGAACATCCTCGAC 
CACGCCCACATTCTCGAACGTATTCGTGGCACTCAAGGCAAAGGTTGGGCAAGCCAGAGCTAG 

>ORF40507(SEQID NO:424) 

TCGATGGGCAGACATATCGCTTGCCCAACCGTGTCCAACAACAGGACTCGCCGGTCATATTCTTGGTACGTCAGGGACAG 
ACAGTGTCTTTCTCCGGCAAACTCACCAGCGACCTGCCAGAAATCGAGTCGTTCTACATTATCAAGCAGGCCCCTCTCGT 
TCCCTTCGGATCGGAGCAGCAACAATGAAGTCGAACAGAGGCTTCACTCTCATCGAGTTGATGATCGTCGTAGTAATCAT 
CGCTATTCTTGCTGGTATCGCCTACCCCAGCTACGACGAATACGTGAAGCGCGGGAATCGCACCGAAGGACAGGCATTAC 
TCAGCGAAGCAGCCGCTACTCAAGAGCGCTATTTTTCACAGAACAATACTTATATCACTACCCAAGCCGACATCGGCAAG 
CTGCATATGCGCAACACATCGGGCACCACAGTGAAGTCCTCCACAGGCAAATACAGCCTTACCGTCGATACGGTAGCCAA 
CGACGGAGGTTATCGCCTTATCGCTAA 

>ORF41275c (SEQ ID NO:426) 

GTGGGGGGCGTCGGAAGAGCAGGAACTGGAGGGACGGGAGGAGAACATTACCTTCTCGATGCCCAAGGAACTGCGGGTCA 
AGGCTTTGTAATCGGAATTTTTGCGCACCTGAAAAAGCCCGGCTTATGCCGGGCTTTGCCTTTTTCTTGTCTCGGCGCTT 
TAGCGCCAGCATTCTGCAACGCTCTTCTTGCTTCCAGTCCGGCCTTTCTCGCCGTTGGCGGTCAAGGTCAGGTTGCCACA 
ATCAAGATCGTTGAATGCCTGGTTAGCGATAAGGCGATAACCTCCGTCGTTGGCTACCGTATCGACGGTAAGGCTGTATT 
TGCCTGTGGAGGACTTCACTGTGGTGCCCGATGTGTTGCGCATATGCAGCTTGCCGATGTCGGCTTGGGTAGTGATATAA 
GTATTGTTCTGTGA 

>ORF42234c (SEQ ID NO:428) 

TCGACGTCCAGCCGGCCTGAACCGTCGGTCGCTGCGCCCTTCCCAAGCGGGGAGGGCGGTAGCAAGGTTCATTCGTCCAA 
TCACCGCGTCGCCCACGAGACCGCCATGCAAATCAAACTCGCCAATCCCCGCGGCTTCTGCGCCGGCGTGGATCGCGCCA 
TCGAGATCGTCAACCGTGCCCTCGATGTCTTCGGCCCGCCGATCTACGTGCGTCACGAGGTGGTGCACAACAAGTTCGTC 
GTGGACAACCTGCGCCAGCGCGGCGCCATCTTCGTCGAGGAACTCGATCAGGTGCCGGACAACGTCATCGTCATCTTCAG 
CGCCCACGGCGTTTCCCAGGCGGTCCGCAAGGAAGCCGAGGGGCGCGGCCTGAAGGTTTTCGACGCGACCTGCCCGCTGG 
TGACCAAGGTGCACATGGAAGTGGTGCGCTACAGCCGCGACGGCCACGAATGCGTGCTGATCGGGCATGAAGGCCACCCC 
GAGGTGGAAGGCACCATGGGCCAGTACGATGCCAGCAACGGCGGTGCCATCTACCTGGTGGAGGACGAGGCCGACGTCGC 
CGCGCTGGAGGTGCGCAAGCCCGAAGCCCTGCACTACGTGACCCAGACCACCCTGTCGATGGACGACACCTCGAAGGTCA 
TCGATGCCCTGCGCGCCAAGTTCCCGCAGATCCAGGGGCCGCGCAAGAACGACATCTGCTATGCCACCCAGAACCGCCAG 
GATGCCGTGAAGGAACTGGCCGACCAGTGCGACATGGTCCTGGTGGTGGGCAGCCCCAACAGTTCCAACTCCAACCGCCT 
GCGCGAACTCGCCGAGCGCATGGGCACGCCGGCCTACCTGATCGACGGCGCCGAGGACATGCAACGCGGCTGGTTCGACG 
GTGTGCGTCGCATCGGAATCACCGCAGGCGCCTCCGCGCCGGAAGTGCTGGTGCGCGGAGTGATCGCCCAGCTACGTGAG 
TGGGGGGCGTCGGAAGAGCAGGAACTGGAGGGACGGGAGGAGAACATTACCTTCTCGATGCCCAAGGAACTGCGGGTCAA 



GGCTTTGTAA 
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>ORF41764c (SEQ ID NO:430) 

AGGCCACCCCGAGGTGGAAGGCACCATGGGCCAGTACGATGCCAGCAACGGCGGTGCCATCTACCTGGTGGAGGACGAGG 
CCGAC6TCGCCGCGCTGGAGGTGCGCAAGCCCGAAGCCCTGCACTACGTGACCCAGACCACCCTGTCGATGGACGACACC 
TCGAAGGTCATCGATGCCCTGCGCGCCAAGTTCCCGCAGATCCAGGGGCCGCGCAAGAACGACATCTGCTATGCCACCCA 
GAACCGCCAGGATGCCGTGAAGGAACTGGCCGACCAGTGCGACATGGTCCTGGTGGTGGGCAGCCCCAACAGTTCCAACT 
CCAACCGCCTGCGCGAACTCGCCGAGCGCATGGGCACGCCGGCCTACCTGATCGACGGCGCCGAGGACATGCAACGCGGC 
TGGTTCGACGGTGTGCGTCGCATCGGAATCACCGCAGGCGCCTCCGCGCCGGAAGTGCTGGTGCGCGGAGTGATCGCCCA 
GCTACGTGA 

>ORF41284 (SEQ ID NO:432) 

CTGGGCGATCACTCCGCGCACCAGCACTTCCGGCGCGGAGGCGCCTGCGGTGATTCCGATGCGACGCACACCGTCGAACC 
AGCCGCGTTGCATGTCCTCGGCGCCGTCGATCAGGTAGGCCGGCGTGCCCATGCGCTCGGCGAGTTCGCGCAGGCGGTTG 
GAGTTGGAACTGTTGGGGCTGCCCACCACCAGGACCATGTCGCACTGGTCGGCCAGTTCCTTCACGGCATCCTGGCGGTT 
CTGGGTGGCATAGCAGATGTCGTTCTTGCGCGGCCCCTGGATCTGCGGGAACTTGGCGCGCAGGGCATCGATGACCTTCG 
AGGTGTCGTCCATCGACAGGGTGGTCTGGGTCACGTAGTGCAGGGCTTCGGGCTTGCGCACCTCCAGCGCGGCGACGTCG 
GCCTCGTCCTCCACCAGGTAGATGGCACCGCCGTTGCTGGCATCGTACTGGCCCATGGTGCCTTCCACCTCGGGGTGGCC 
TTCATGCCCGATCAGCACGCATTCGTGGCCGTCGCGGCTGTAGCGCACCACTTCCATGTGCACCTTGGTCACCAGCGGGC 
AGGTCGCGTCGAAAACCTTCAGGCCGCGCCCCTCGGCTTCCTTGCGGACCGCCTGGGAAACGCCGTGGGCGCTGAAGATG 
ACGATGACGTTGTCCGGCACCTGATCGAGTTCCTCGACGAAGATGGCGCCGCGCTGGCGCAGGTTGTCCACGACGAACTT 
GTTGTGCACCACCTCGTGACGCACGTAGATCGGCGGGCCGAAGACATCGAGGGCACGGTTGACGATCTCGATGGCGCGAT 
CCACGCCGGCGCAGAAGCCGCGGGGATTGGCGAGTTTGATTTGCATGGCGGTCTCGTGGGCGACGCGGTGATTGGACGAA 
TGAACCTTGCTACCGCCCTCCCCGCTTGGGAAGGGCGCAGCGACCGACGGTTCAGGCCGGCTGGACGTCGA 

>ORF41598 (SEQ ID NO:434) 

CCTTCGAGGTGTCGTCCATCGACAGGGTGGTCTGGGTCACGTAGTGCAGGGCTTCGGGCTTGCGCACCTCCAGCGCGGCG 
ACGTCGGCCTCGTCCTCCACCAGGTAGATGGCACCGCCGTTGCTGGCATCGTACTGGCCCATGGTGCCTTCCACCTCGGG 
GTGGCCTTCATGCCCGATCAGCACGCATTCGTGGCCGTCGCGGCTGTAGCGCACCACTTCCATGTGCACCTTGGTCACCA 
GCGGGCAGGTCGCGTCGAAAACCTTCAGGCCGCGCCCCTCGGCTTCCTTGCGGACCGCCTGGGAAACGCCGTGGGCGCTG 
A 

>ORF42172c (SEQ ID NO:436) 

CAAGGTTCATTCGTCCAATCACCGCGTCGCCCACGAGACCGCCATGO^TCAAACTCGCCAATCCCCGCGGCTTCTGCG 
CCGGCGTGGATCGCGCCATCGAGATCGTCAACCGTGCCCTCGATGTCTTCGGCCCGCCGATCTACGTGCGTCACGAGGTG 
GTGCACAACAAGTTCGTCGTGGACAACCTGCGCCAGCGCGGCGCCATCTTCGTCGAGGAACTCGATCAGGTGCCGGACAA 
CGTCATCGTCATCTTCAGCGCCCACGGCGTTTCCCAGGCGGTCCGCAAGGAAGCCGAGGGGCGCGGCCTGAAGGTTTTCG 
ACGCGACCTGCCCGCTGGTGACCAAGGTGCACATGGAAGTGGTGCGCTACAGCCGCGACGGCCACGAATGCGTGCTGATC 
GGGCATGA 



>ORF42233c (SEQ ID N0:151) 

C(3ACGTCCAGCCGGCCTGAACCGTCGGTCGCTGCGCCCTTCCCAAGCG(WGAGG(KGGTAGCAAGGTT™CGTCC^T 
CACCGCGTCGCCCACGAGACCGCCATGCAAATCAAACTCGCCAATCCCCGCGGCTTCTGCGCCGGCGTGGATCGCGCCAT 
CGAGATCGTCAACCGTGCCCTCGATGTCTTCGGCCCGCCGATCTACGTGCGTCACGAGGTGGTGCACAACAAGTTCGTCG 
TGGACAACCTGCGCCAGCGCGGCGCCATCTTCGTCGAGGAACTCGATCAGGTGCCGGACAACGTCATCGTCATCTTCAGC 
GCCCACGGCGTTTCCCAGGCGGTCCGCAAGGAAGCCGAGGGGCGCGGCCTGA 
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>ORF42233c (SEQ ID N0:151) 

CGACGTCCA6CCG6CCTGAACC6TCG6TCGCT6CGCCCTTCCCAA6CGG6GAGGGCG6TA6CAA6GTTCATTCGTCCAAT 
CACCGCGTCGCCCACGAGACCGCCATGCAAATCAAACTCGCCAATCCCCGCGGCTTCTGCGCCGGCGTGGATCGCGCCAT 
CGAGATCGTCAACCGTGCCCTCGATGTCTTCGGCCCGCCGATCTACGTGCGTCACGAGGTGGTGCACAACAAGTTCGTCG 
TGGACAACCTGCGCCAGCGCGGCGCCATCTTCGTCGAGGAACTCGATCAGGTGCCGGACAACGTCATCGTCATCTTCAGC 
GCCCACGGCGTTTCCCAGGCGGTCCGCAAGGAAGCCGAGGGGCGCGGCCTGA 
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>0RF2 (SEQ ID N0:3) 

SPIQCQ6VPGQSEPTH6CRGRHCQAP6RRREQHQYRLQRQRHQLRDDRNQQQLGPQQHPLRRRQRHPAVDEQWRGGLRR 
RLRAARCAGRSASRSATGDRL* 

>0RF3 (SEQ ID N0:5) 

RRSNMEYLGNQSLLTAAGAGIAKLLDADENNTSTVFSGNGTSFGTTGTNSNSALNSILSGGVSDIRQWMNKLYGEAFAA 
VYVQPGARVAVHLDQQLAIDYELKGRKVDYSSGAAHATADLD* 

>ORF602c (SEQ ID N0:7) 

SAWSFAEASCCGSIGRRSVCLASRSSRPRLLPIELVAPRSQTSSMLASPWGSISSLLVEHAARVSAQARPAQRRRRGLVQ 
VCCCMSGSRAVIDLAALEFIVDRQLLIEMHCDPRTWLHVDGGEGLPVQLVHPLPDVADAAGEDAVEGRVAVGSGRPEAGA 
VAAEDGTGWLVGVQELGNAGPGSRE* 

>ORF214 (SEQ ID N0:9) 

TSCTGRPSPPSTCSQVRGSQCISISNWRSTMNSRAARSITALEPLMQQQTWTNPLLRLCAGLACALTLAACSTSKEEMLP 
HGEANMLDVWERGATSSIGNSRGRLLLDARQTLRRPIDPQQDASANDQADYTRTASNEIHSQFKRLPNPDLVMYVFPHLA 
GSDPAPVPGYTTVFPFYQRVQYAMPGERTEDY* 

>ORF1242c (SEQ ID N0:11) 

SRPGRRTGQSRVRFRAPJIRSSAGLLSMPGRSASM)RGPRCPRAPVRRMRRANAHPPGASLARRAGTQPRAAGLRTMGR 
DRRGVTLRPAWRHSCSRCWAEEYPWRPVAPDSAQSLLPRPLRPALLNLRERLPVPTEAVCDRAEGFEKSPSIVLRAFARH 
GVLDSLVEGEHGGVARYRGGIAAGQVREHIHHQVGIGQSFELTVDLVAGRAGWGLVIRGGILLRIDWAPQRLPGIEEQP 
ATAVAYRAGRTSLPDVEHVGLAVGQHLLLAGGARCQGQRAGQAGAETKKGVSPSLLLHERLQSCNRPCGP* 

>ORF594 (SEQ ID N0.13) 

PGRLHPHGQQRDPQSVQTTAQSRPGDVCVPAPGRQRSRPGTGLHHRVPLLPASPVRHAGRTHGGLLMGFFQTLLRGRTQP 
QSVPADAPEDSGALDVAAAEEATERYLMLAMGIPLPNTGSKNGATQAEASRLYDHDPSFVDLLPWAEYLPDEQVMLLE 
DGRSRAAFFELVPLGTEGRDPNWMQNARDALKEALQNSFDEHETSPWIVQFYAQDEISWDNFQEQLRQYVHPRARGSAFS 
EMYLALMKHHLEGISKPGGLFVDTAVSKLPWRGQQRRVRMVVYRRIRKEDAQIRGQDPAAYLKSICERIQGGLANAGIVA 
SRMGGQEIRNWLIRWFNPHPDHLGQAEADLRRFYELVCRPDEPILQDELPLADGTDFSQNLFYRQPVSDATQGVWLFDAM 
PHRVIVVDQLNKAPLTGHFTGETLKGDGLNALFDRMPEDTLLCITMVVTPQDMLEGHLQQLSKKAVGDTQASIHTREDVA 
TVRRLIGREHKLYRGAIALFVRGRDHTQLEERCITLSNVLLGAGLVPVEPQNEVGPLNSYLRWLPSNFDPNEKRALEWYT 
QMHFAQHIANLSPIWGRTTGTGHPGFTLFNRGGAPLTFDPFNKLDRQMMGFIFGPTGSGKSASLTNLICQMLAMYLPR 
MFVAEAGNSFGLLADLAKRFGLSVHRVRLAPGSGVSLAPFADAIKLVESPDQVKVLDAEDIEASDSVQGSKADLEDDQRD 
ILGEMEIVARLMITGGEEKEDARLTRADRSAVRQAILAAARTCAAANRTVLTQDVRDALYEASRSDSTAPERRARIAEMA 
EMQMFCHGADGEHFNREGTPWPEADLTVVDFATYAREGYAAQLGIAYISLLNTVNNIAERDQFKGRPIVKITDEGHIIT 
KHPLLLPYAMITKMWRKLGAWFWLATQNIDDIPASGAPMLNMIEWWLCLNMPPDEVEKISRFRELSPAQKSMMLSARKE 
SGKFTEGVLLMGKEYLVRWPPSLYLALAMTENEEKNQRYNIMQATGCDELEAALQVAADLDKARGLPPFPIVFPDQPA 
VECQDE* 

>ORF1040 (SEQ ID N0:15) 

VPARRASDAPGGWAFARRILRTGALGHRGPRSQLDAERPGRIERSPAELLRRARNLTLDCPVLRPGRDQLGQFPGAVEAV 
RPSSSARIGLQRDVPGAHEASPGGHFEAGRTVRRHRRQQAALARTTAPRADGRLPPDPQGGCADSRTGPGGVPEIHLRAY 
PRRPGERRHRRFAHGRTGDQELVDPLVQPAPGSPRPGRGGPTSLLRTGMPSGRTDPAG* 

>ORF1640c (SEQ ID N0:17) 

VRLGLAEVIRVRVEPADQPVPDLLSAHARSDDAGVRQAALDTLADGFQVRRRVLSANLRILLADPAVDDHPHAALLSSPG 
QLADGGVDEQSARLRNALQVMLHERQVHLAEGRSSRSRMDVLPQLLLEIVPADLVLGVELDNPG* 
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>ORF2228c (SEQ ID N0:19) 

GEPAQVAVQRSDFVLRFDRHQAGAEQYVAQGDAAPLQLGMVAAAHEQSDRSAIELVLPADQASNG6HVLA6VDRGLGVTN 
GLFRELLQMPFQHVLRRHDHGDAQQRVLGHSIEQGVEAIAFERLAGEVACQRRFVQLVDHNHSVRHGIEEPYALGGIGNR 
LPIEQVLGEVSAVGQWQFILQDRFVRTAYQFVEAT* 

>ORF2068c (SEQ ID N0:21) 

SLCSRPIRRRTVATSSRWIEAWVSPTAFFESCCRCPSSMSCGVTTMVMHSSVSSGIRSNRALRPSPLSVSPVKWPVSGA 
LFNWSTTITRCGMASKSHTPWVASETGCR* 

>ORF1997 (SEQ ID NO:23) 

HPGLDPHPRGRGHRSTPDRPGAQALSRSDRSVRARPRPYPVGGTLHHPEQRTARRRPGAGRTAERSRTAEQLPALAPLKL 
RSKREASPGVVHPDDVRSAHRQPVMLGAHHRYRTPWLHAVQPWRRAVDLRPVQQAGPADECPRLHLRANWLRQVGVPDQ 
PHLPDARHVPAADVRRGSGQQLRPAGRLSQAVWPLGPPGAPRPGLRRQPGAVRGRHQAGREPRPSEGAGRRRHRGLGLGP 
GQQGRPRGRPARHPGRDGDRRPPHDYRWRREGRCAPDPCRSQRRPPGDPGGGQDLRRREPHGTDPRRARCALRGLQER* 

>ORF2558c (SEQ ID NO:25) 

VGQQAEAVARFRDEHPRQVHGEHLADEVGQGRRLAGASWPEDEAVGIHLPVQLVERVEGQRRAATVEQREARVSGTGGAP 
PDGRQVGDVLSEHHLGVPLQGSLLVWIEV* 

>ORF2929c (SEQ ID NO:27) 

SASRTSWVSmFmQVIJ^RIAWRTALRSARVRIlASSFSSPPVIMRRATISISPRMSRWSSSRSALLPWTESEASMS 
SASSTFTWSGLSTSLMASANGARLTPEPGARRTRWTERPNRLAKSASRPKLLPASATNIRGRYMASIWQMRLVRDADLPE 
PVGPKMKPWAFICRSSLLNGSKVNGAPPRLNSVKPGCPVPWRPQMGDRLAMC* 

>ORF3965c (SEQ ID NO:29) 

APVGPYQAVDWAAIHPRAALSAGRYPGDRLPSVESAAAPLSVQERISLASAGHPLRGSAGSGSGCRSGSGSANSELSFV 
LALHCRLVWEMGEGWQAARLVEIRCDLQGRLELVAAGGLKWALVLFFVFGHGQGQVETRGNHTDEVFFALGQEHALG 
ELAAFLAGREHHRLLRRRQLAEPGYLLYFVGGHVQAQPPLDHVQHRRPGGWDWDVLGGEPEPGAQFPPHLGDLHGVGQQ 
QRVLGDDVPLIGDLDDWPALELVAFGDWHGVQQRDVGDPELGGVAFARVRCEIHHGKVGLRPGRAFAIEHLAVGAHAEH 
LHGFRHFGDPRAAFWRGAIAPGGLVERIAHVLGQYRAVRGGAGPGRRQDRLADGAAIGTGQARIFLLFATGNHEAGDDLH 
LAQDVSLWLEVGLAALDRVRGLDVFGVQHLHLVGALDQLDGVRERRQADAGARGEAHPVDREAKPLG* 

>ORF3218 (SEQ ID N0:31) 

GMHHQAPAAAALRHEDHQDVAETGRLVIARHPEHRRHPSLRGADAEHDRVVVVPEHAPRRSREDIQVPRAVAGAEVDDA 
LGPQGKRQVHRGRAPGQGQRIPRPCGSPESLPGPGHDRKRRKEPALQHHASHRLRRARGGLAGRSGSRQGARPATLPHCF 
PRPTGSGVPGRMRVLNSLTQNLIDHLTQILQNPEEDALQTLRICAPVLIEELQQIQLRAVDRRDIVPQIKQLLDEWLQQH 
PQPDTAQQALIEAVDRAEILQRRQA* 

>ORF3568 (SEQ ID NO:33) 

PKTKKRTSATTSCKPPAATSSRRPCRSQRISTRRAACHPSPLFSQTNRQWSARTNESSEFADPEPDRQPDPDPAEPRRGC 
PADAKDMRSCTDRGAAADSTEGSRSPGYRPADKAALG* 

>ORF4506c (SEQ ID NO:35) 

VMFWFRTFLQSSLVQFRKVQCAARQPAPVAGRLSEDRIDSAPEGFGAALDPRALHQASLVAGRLAMHLQGKMAPNQVH 
VRMAVYPALKPRGVDLAEGALQVGVFIDRPARFRIAVEAWGWQALHQKLYPYGGCSQQDQQQPRPGQGGTLKSFGCPAA 
LQESHACLRCRISARSTASMSACWAVSGCGCCCSHSSKSCFICGTISRRSTALS* 

>ORF3973 (SEQ ID NO:37) 

GRGPRGDPTAEASVRLLKGGWAAKRFQGPALPWAGLLLVLLAASAVGVELLVKGLPANHSLYGDAKARWTINEYADLECP 
FCKVYTPRLKRWVDSHPDVNLVWRHLPLQMHGEAARHQARLVECAGIQGGAKAFWSAIDAIFAQSAGNGGGLPGGTLDFP 
ELDQARLEKCAKDNELIDSDIKLDIDIARSKGITATPTLVIRDNQTGRSVKLEGMADETTLLSAIDWLAKDL* 
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>ORF4271 (SEQ ID NO:39) 

TWF6AIFPCRCMARRPATRLAMSARGSKAAPKPSGALSMRSSLSRPAT6A6CLAAHWTFLNWTRLDWRNVRKTTNLLTQ 
ISSWTSTLHGRRALQRPRPSSSGTTRRDEA* 

>ORF4698 (SEQ ID N0:41) 

EIGEDSNIPLLVLQDALHFTWQNLDLLPIHNLYHSLVAGAGEAKPQLHCRPSIDVNALEQALHDFDHSLISVSQLHTGIM 
LPRTCRRHPYLCTWQRSITARKNTPPTS* 

>ORF5028 (SEQ ID NO:43) 

FPAALSEVILSAVCTFLEPVQTHASSSLPPWPAATNAGRWRTTGTAEQRESGRNLGHHRQGSSGLCHRIVARSVSGRPGT 
PRGATDCGLAPGSTACSSGV* 

>ORF5080 (SEQ ID NO:45) 

NRYRPMPLHHSPPGRRPPTLAVGVLLVLLSSASQAETWVITDKAHPVSATGSSRVLFLDAQEHLEEQLTAALPQDPQHAQ 
AAFKRLLQSPDGRRLQAELVKAQQDVADAWSLGVEKIPAWVDRQYWYGEPDVSRALELIAKARRSR* 

>ORF6479c (SEQ ID NO:47) 

FVSVSLLEVGTADEHLPLALAAGVGTPERPGVLPVDGLRLRPRVGKHRAVEAQGWGQLLPFPGRGIALFQLARRPVAVLG 
GCAHGEVDVELADSRGDIAGALGDDGCRLVVVGLVQEAAARIEVPPHVAGEDSTHLAQPWDQRFGVHLLGNSMPPANGVQ 
CAEKVRHQRDGGARANVPRGAGEPAERGATRMADHIRFLEAADAVLGLWCGRVIAGLGEWIRCTQRRYLGPGVAPGIRV 
AGDDCVPJIWADLDRRLHFAAMRAAEQPVTDPDDLVFEALRGKGGGDDGSAVERGRGREREAEGGGRRCQAAEVEAGHQR 
DLLALAISSRARETSGSP* 

>ORF5496 (SEQ ID NO:49) 

ANRQGQEVALMTSLNLRRLAAAAATFSLSFTASAAINSAAIVSSTLSPQCLEYKWGICYWLLCGPHGCKVKTSVKVRHY 
VPDAWSSYANTGSNPWTEVSALGTPNPLAQAGNDATTNYKAENSIGRFKEADVIGHPGGATFSRFASASGYVCPGATVP 
LVPYFLSTLDAIGWRHGIPEQVYPEALVPGLREVGGIFSGDMWGNLYPRSGFLHQTDDYKTAAVIAQRAGDITTRIGQLH 
VYLPMRAAPKDGYHPAGELKEGDASTGKWQELTPSLSLNCAVFPNSGPKTQAVDGEHAWALWRPYSCCQRKGQMFICSTD 
FQ* 

>ORF5840 (SEQ ID N0:51) 

RDHKLQGREQHRPLQGSGCDRPSWWRHVQPVRQRLWVRLPWRHRPAGAVLSQHTGRHWLAAWNSRAGVPRSVGPRAARGG 
MLLRRHVGEPLSAQRLPAPDRRLQDGSRHRPARRRYHHANRPAPRLPPHARSPQGRLLAGGRAERGRCLDREMAGADPI 
PEPQLRGVSQLWAEDASRRRGARLGALASLLLLPAQGADVHLQYRLPIRTRRRIHRNNITSVALMHLLAAQLAQADDPIN 
VSKTGTVLSDEVLYSIGGGSAVSMGSAGQMDSIGVGFGWNNDMMCGNMNLSTTLENQLNGATQGFQNIMGSVIQNATGAV 
MSLPALIIQRANPQLYNLITNGILQARIDYDRSKGTCKTIAEKMADIAGEQTGWGKIAEGQALGATLASDGKDAVSALEA 
VEKKGGNDGVTWVGGDKAGGSGQKPIRIViroVTRAGmLTSRSVNDSSSVPSATOTGLVCmSSPQEAAAFATRVLG 
EQQQQTCEGCQKTVTAAGVGLTPLIQETYDKKLQSLQELLSKSKPLTAENLAAAGTDALPITRGVIEALRDERDQDVLAR 
RLASDVSLMDVLSKALLLQRLMFAGAKEPNVAANGLATQAVDQQTSLLQQEISNLKTELELRRELASNSPMRVIERGQQR 
ASGSSGVFESAPDADRLDRLQAPSAAGGKSGGRP* 

>ORF5899 (SEQ ID NO:53) 

SAILVAPRSAGSPAPLGTFALAPPSRWCRTFSAHWTPLAGGMEFPSRCTPKRWSQGCARWVESSPATCGGTSIRAAASCT 
RPTTTRRQPSSPSAPAISPRESASSTSTSPCAQPPRTATGRRAS* 

>ORF6325 (SEQ ID NO:55) 

ASTARCFPTLGRRRKPSTGSTPGRSGVPTPAASARGRCSSAVPTSNKDTETNHANEHHLGRANVAARSATCPGRRPDQRV 
QDRHGAQRRGPLQHWRRQCGEHGQRRPDGLDRRRLRLEQRHDVRKHEPEHHPGEPAQRCHTGFPEHHGLSHPERDRRGHV 
AAGVDHPAREPSALQPDHQWHPAGADRLRPLERDLQNDRRKDG* 
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>ORF7567c (SEQ ID NO:57) 

QCLAEHVHQGDIGRQAARQDVLVTLVAQRLDDAAGNWQSIGAGRSQVLCSQWFALRQQLLQRLELLWGLLDQRGEADAS 
SRHRLLAAFAGLLLLLPQYPGGECGGLLGGGPSVADQAWASGGRHARRIIHRAAGQQWARPGHWDDANGLLAGAAGL 
VSTNPGYAIVAAFLLHCFE66YGVFPVRGQCGAQGLAF6DFPPAGLLASDVSHLF6DRFASPFRAWVDPRLQDAI6DQV 
VELRVRALDDQRRQRHDRAGRVLDD* 

>ORF7180 (SEQ ID NO:59) 

FVERAFRHLQQRPGLQHLVLPPGGRRIRHPGTGGATATDLRRLPEDGDGCWRRPHPADPGDLRQEAPVAAGAAVEEQTTD 
CREPGCGRHRCSANYPRRHRGAARRA* 

>ORF7501 (SEQ ID N0:61) 

PGRPGAPPGVRCLPDGRAQQGTATAAPDVRRRQGAQRRRQRPGHPSRRSADQPPAAGDLQSQDRTGTPSRVGQQLPHAGH 
RARATTRLRVQWRVRVGARCRSPRSPAGPLCRRRQVGRETVMADTLTTRKLLGQLLVGVLIVIGLAWGTLLSLFALNHF 
GGIQGLEAWRQSNYWSLFAWRALLYCALAIAWFRQRKELSAHERQRIRRIEILVLLLVLLIEFSKAYFRTGGAA* 

>ORF7584 (SEQ ID NO:63) 

CSPAPRSPTSPPTAWPPKPSISRPASCSRRSPISRPNWNSVASWPATPPCGSSSAGNNAPQGPVACSSRRPMPIASIACR 
PPLPPAASREGDRDGRYAHHPKASRSATGRSADRHRTGSGRYAAQSLRPEPLRWHPGPGGLAAKQLLELVRLAGAAVLRP 
GHRLVPAAQGTERA* 

>ORF8208c (SEQ ID NO:65) 

RSCCASRAEVGFAEFDEQDQQQHQDLDPPNALPLMRAQFLALPEPGDGQGAVQQRPPGEQAPWALPPGLQALDATEWQ 
GEETEQRTDHCQSDDDQHSDQ* 

>ORF8109 (SEQ ID NO:67) 

AAMSADRDPGAVAGPMRIQQSLLPHGRRSMTFMTNDYLEYYLTLLGWIIMGIWNMISDTGLFAVPFAAIVMREWLKV 
RGEGADEGinCGVLSLARIETHIYVGYIWALAGIPVVlWSFDTIEFDQTRAQQCQYNLPAPADTGWSSSFSSIjAGKSAQM 
PLMAMMHALSKGFTSGAIMIPCGTDLRQMRMEVDNTRVNNPLLAQEIADFSRDCYGPSRARLFMRQPDLGSVAEDNKA 
LQDLNWIGSRFLLNTPGYYDTDYSKSPRQSWPYNATRDAGLPQVGGGGGYPTCKQWWADSGIGLRDRIKDQVDPDLMTSF 
LKWAKWLNQDEWEAVIRQVISPSSQVKGNVnDYGGQVGGTVWNGIARTAGTFGVAVGSLAYFPAMDMVRQALPMVMSF 
LKMMVICIPMVLVIGTYQLKVAMTMTWFFAMMFVDFWFQLARYIDSTILDAFYGSGSPHLSFNPVMGLNTATQDAILN 
FVMGSMFIVLPLLWMTAIGWSGIQAGSVLNGLSRGTEGVQAAGKEAGNRVKNAV* 

>ORF9005c (SEQ ID NO:69) 

VSPPLLAGWVAATTAHLRQAGIAGGWGPRLTGTLRWGWVPRGVQQESGADPVQVLQRLWLGDGAQVGLPHEQPRTG 
RPVAVSGKISDFLCQQRIVHARVVHFHSHLPQIRAARNGRDGAAGEALGQGVHHRPPERHLRTLAGQAAEGARPAGVRRC 
RQIVLALLGASLVELDGVEAHVDDRDPRQGDHDVADVDMRLDAGERQHSLVALVGAFPTNFQPFAHHDGRERHREQASIR 
DHVPDPWDDPAEEGEVILQWIGHEGHAAPPVRK* 

>ORF8222 (SEQ ID N0:71) 

LPGVLPHPPRLDHQQRDLEHDLGYWPVRGAVRGHRDARMAESSWGRRRRGQQGSAVSRPHRDAYLRRLHRGRPGGDPGRQ 
RELRHHRVRPDSRPAVPIQSAGTGGHRLVELLQQPGRQECADAALVGDDARPVQGLHQRRHRGHSVRHGSAADANGSGQH 
AREQSAAGTRNR* 

>ORF8755c (SEQ ID NO:73] 

QSLEKSAISCASSGLFTRVLSTSIRICRRSVPHGMAAMAPLVKPLDRACIIAHQSGICALLPARLLKELDQPVSAGAGRL 
YWHCWARVWSNSMVSKLTLTTGI PARATTM* 

>ORF9431c (SEQ ID NO:75) 

LKPEVDEHHRKEDDRHRHGNFQLIGADDQDHRNADDHCHLQERHHHRQCLADHIHRREVCQAAHRNAEGSCGSRDAVPHG 
AAHLPAVIGVDVTLDLAGG* 
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>ORF9158 (SEQ ID NO:77) 

RLHRLRRAG6RHRVERHRENRRNLRRC6GQL6ILPGDGY6PP6TADGDWPEDGNGHLHSDGPGHRHLSTESCHDDDGRL 
LCDDVRRLLVSVSQIYRQHDT* 

>ORF10125c (SEQ ID NO:79) 

VIAGCLPLGARRIMNMJmGFASRIGFGLGMLVRFCLHDRRPALRWVKRVSLFLLVALVVSQNFMWLAGVSMTLLCVF 
LVGFALVKGDISVSKGSPSRDVSTMTSQAETESVAELFDYQAAHHYRD* 

>ORF9770 (SEQ ID N0:81) 

SNSSATDSVSACEVIVETSRLGDPLETEMSPLTKANPTRKTHSRVIDTPASHIKFCDTTRATNKNRLTRLTQRRAGRRSC 
RQKRTSIPRPKPIREAKPLLVCAFIINLLAPKGRHPAITYTPKKMIWQALWHIMPLAICRLEYLMATRNVVLPDPLEQDI 
NELVETGRYQNRSEVIRAGLRLLLQQEAQIAKLETLRNATSSGLMQLERGEYDEITSDELAQYLDELGNQASH* 

>ORF9991 (SEQ ID NO:83) 

SWTAIMQAETHKHTQTKTDPGGKAFVGVRVHHQSPGSQREASCYHLYAEKDDLASIMAYYATSYLPTGVPHGNAKRRPSR 
SAGAGYQRAGGDRPLSESQRSHPGRLAPAAATGSPDRQARNPPQRNIQWADATGARRVRRDHQRRTGPIPRRARQPGEPL 
KHGQVPHLS* 

>ORF10765c (SEQ ID NO:85) 

HLVCRHPVEDEVPGPNNLTDIGHRVAVNEVDAAQASSQFFTADAAYLLWVCRNCFQRRPYECLVTSARGIAEVWGEAQD 
IDDVRLGIMRDAVLGHASVARLVAELVEVLGQFVAGDLWLAALQLHQPTGCCVAEGFELGYLGFLLQQQAQACPDDFAA 
ILIAAGLHQLVDILLQRIGKDDVSRCHEVLQSADS* 

>ORF10475 (SEQ ID NO:87) 

SMAKYRISHDAQADIVDILRFTHNHFGDAARRRYQALIGAALEAVATDPQQVGSISREELGAGLRSIHLVYCHSMPNVGK 
WRPRHFVFYRVATDQVLEWRVLHDAMDVDQHLPQR* 

>ORF11095c (SEQ ID NO:89) 

SRMQAWSTNANAWSGGMQSSGQATAIAHQPWGTCWWMFTRSLFVFAGANAALSAFRQALSGRAFTLVMHSLRPSSPFPL 
WAICSCYSCSSLGQVLIHIHGWKHANHL* 

>ORR1264 (SEQ ID N0:91) 

TAVRRDLLKLMGCTHIEADYIGGLRCSTAPEGTWVAHGFHGPIVDVIDDSAGFFSTHRLALHYPAQCGLAVDQAIPRTAI 
HVASPLMH7CIGKVWI SAWMC* 
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>ORF11738 (SEQ ID NO:93) 

EEVIMKLQAYRLQNYRRLRDWIELDDEISIFVGANNSGKTSAVQGLYSMLRGEVKKFELFDFSAALWAEIDAVGRTPP6 
DEDAPKRLPSILLDLWFRVGEDDLATAMSLLPSTEWDGKCVGIRVAFEPRDAHELVWKFHELHEKANNAAVALAAKRKM 
GEQAVEAGAEDAAAWADAGEYKPWPESLTKYLTKELSKEYTFRYYVLDERAFVGYQAREADYEPLPLGKEPGGAAILKS 
LVRVDFLRAQRHLDDPDAGSSDRAESLSRRLSRFYHRNLEKRGDDHAALKALDTSEKELNFHLREVFNDTLTRLAKLGYP 
GVMPEIVimLDPTmGQDAmYVIPGVASAQLPDSYNGLGFmVYMWELU)LHEQWKAEDDKRAPLHLVFIEE 
PMLHAQIQQVFIRmRLLEDAlTOHATLFHTQLVITTHSPHILYERGFSPIRYFRRViroQLGHHTDVmSLFKTGAS 
DAPAREFLQRYLKLTHCDLFFSDAVILVEGNVERLLLPAMIELVAKRLRSSALTILEVGGAFAHRFQELIAFVGLTTLVI 
TDLDSVTVKTDAEKAAAQGAGAEGAVDGDDEDEDDDLKPFELEDDDEAEPSGKKKSKKRGSTCHAHVEGAVTSNQTLISW 
IPKKRSMAELWEVTAEQKTLSLAEDSSAGVRVAYQTKVSVTVGATTSQLCGRTLEEAFGLENADWCQAEANRSVGLKLKR 
APSSPEELAEKLHDRWGKNFDKTRFALEVLASGPLNGWKVPAYIAEGLAWLEAKVAHELEADAAIATEVATIEPTTADV 
VAIIVDPGQTA* 

>ORF12348c (SEQ ID NO:95) 

RKVYSLLSSFVRYFVRLSGQGLYSPASATTAAASSAPASTACSPAALRLAASATAALLAFSCSSWNFQTSSWASRGSNAT 
RIPTHLPSESVLGSSDIAVARSSSPTRNQRSKSMDGNLLGASSSPGGVLPTASISAHSAALKSKSSNFFTSPRSIEYRPW 
TADVFPLLLAPTKIEISSSSSMTTSRSRR* 

>ORF12314c (SEQ ID NO:97) 

GTSSGFLAKACTRRHRPPPQPRLPRPPPQLAPRRPCAWPQARQLRCWPSHWHGTSRRARGHPEARTLPGSRRTCRPTQC 
SAMTSQWRGRLHRRGTRDPRVWTVTFWAHPHRQGASCRPHRSRPTAPH* 

>ORF13156c (SEQ ID NO:99) 

RQIAHIRVMAQLWDAAEVPNGRESAFIEDVRGVRGDDELRVEQSRVIVSILKEAQNVPDEDLLDLRVQMRLRLLNEDQM 
KRSSLVILGFPLLVQVEQLNHHVDQILEPQAIVAVWQLGGSYARDHWNLGVLPQDSGRIQGRPNHDLRIVDARIAELGQ 
AREGVIEDFLQVEVQLLLRGI* 

>ORF12795 (SEQ ID NO:101) 

LPPNCQTATMAWGSRIWSTWWLSCSTCTSSGKPRMTSELRFIWSSLRSLRRICTRRSSRSSSGTFCASLRMLTITRLCST 
RSSSSPRTPRTSSMNADSRPFGTSAASTTSWAITRMCAICRYSKRARPTLQRANSCSGI* 

>ORF13755c (SEQ ID N0:211) 

ATRTPALESSASDSVFCSAVTSQSSAIDRFFGIQLMRVWLDVTAPSTCAWQVLPRFLDFFLPLGSASSSSSSSKGFRSSS 
SSSS SP STAPSAPAPCAMFSASVLTVTLSRSVMTSVVSPTKAISSWNRCANAPPTSRM7RAEERRRLATNS I IAGRSRR 
STLPSTNITASEKKRSQCVSFRYRCRNSRAGASDAPVLNSDRLRTSV* 

>ORF13795c (SEQ ID NO:213) 

CRRTHRHRKLGLVSYPNPSAGILSQRQRLLLRRDFPELCHRPLLRDPADEGLVGRDGTFHVCMAGAATLLGLLLATRFCF 
VWFKLEGLQVWLVLVISVNGALSACALRGGLLGVRLDRHAVQIRDDQCCEPNEGDQLLEPMRERTTDFKDG* 

>ORF14727c (SEQ ID NO:215) 

QEVGELKDVLVAKYALGWTAHAWERPDAGHSLQASDISLLVGLVPVARGLPDTRAVILELFFKFGDPPADWLQPNLD 
VGLERLCNRPVKAVDGRDMNQRVIVDVRQDFIGGKVGIRDARDYLLPRPHAGAVLRDHPVQRLDEGGGLAGTCASTNHEG 
LRRRRYNACVDLAVGVGIWAINSSAHAVCPGSTMIATTSAWGSIVATSVAMAASASSSWATLASSQAKPSAMYAGTFQP 
LSGPLASTSSAKRVLSKFLPTTLSCNFSASSSGLLGARLSLRPTDRFASAWHQSAFSRPKASSSVRPQSCDWAPTVTET 
LVW* 

>ORF13779 (SEQ ID NO:217) 

RWVRRHHSSAAAHLRRPLVLRTRTGARLRQTGRSASSSSAHRAALKSWLRSYTIGWSARTSTRPALRWRYSQAGRSMAGR 
FPRTSPRAWPGSKPKWPTSLRRMLPSPPRSRLLSRLQPMLSLSLLTRGRRHEQTN* 
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>ORFI4293c (SEQ ID NO:219) 

GRWSCRNLRQHEPRRAAEASLQCMRGSRGRRRYLGYQFVCSCRLPRVNNDSDNIGCSRLNSRDLGGDGSIRLKLVGHFGF 
EPGQALGDVRGNLPAIERPACEYLQRKAGLVEVLADHPIV* 

>ORF14155 (SEQ ID NO:221) 

PGADGMSRRIDSPDTDADREIHACIVATPPQPFWRAGAGSGKTTSLIKALDWVISEHGASMRARKQIVACITYTDLATN 
EILADVNDDPLVHVSTIHSFYWSIMTFQADIKVWLQNDIRRRISELEEEFEnSSRVRQTTRDRNKADQERYVRSLEAV 
AGVRTFNYGVGSDYAKGILGHEDILQLADFLLQNRPLFRRWALSYPFVFIDESQDTFPGWKSFKEVEAQMQGKFCLGF 
FGDPMQSIFMRGAGDIQLEDHWRAITKPENFRCAKQILDVANAVRAQGDGMEQVRGLHERVDGNLKLVEGSARMFVLPNT 
LNRTEAIMmWSSATNlTOEGWTTPDIAVKILVIVHRMAAMLGFGGIYSALNDKTSDAMKQGMQDGTGWPVRPFLSFA 
LPIVAAVKAGNEFAAMSLLREFSPRLAPAALTGRRAADVLRELHAAASRLVAMLDEAGTTIGDIALHLCDTGLFEFDERY 
ARVLGFVRDIADTAQEPEAADAVPAEGLSLDATMAKFFNCSAQELWPYERYVSEGSPYATQHGVKGAQFERVMVVMDEEE 
SDYRTYNYERVFASAEARAADRARALDGDETOSRTLRLLYVCCTRAQRGLVLAFFVADPATTLENWASGILPRSAVFT 
QEVLVGWP* 

>ORF14360 (SEQ ID NO:223) 

SRASRIPTLPPMKSWRTSTMTRWFMSRPSTAFTGLLQRRSRPTSRFGCRTTSAGGSPNLKKSSRITARVSGRPRATGTRP 
TKSDMSEAWRLWPASGRSTTAWAVTTPRAYLATRTSFSSPTSCYKTARCSDGSWR* 

>ORF15342c (SEQ ID NO:225) 

EGSNGPTGAVLHPLLHGIRRLWQRRVDAAEAKPVCGHAVHDDKNLDCDVWGCPTLWVRRRAPRSDSCQSLGSVQRVRQ 
DEHPGRPLHQLEVPIDPLVQPADLLHAIALRAHGIGDVKDLLGAAKVLRLRDGPPMILKLDVPCASHEDRLHRVAEKTKA 
ELALHLGFHFLERLHYTRERVLTLIDKHERVAQRHDPSEQRAVL* 

>ORR5260c (SEQ ID NO:227) 

MPPKPSRFAAMRCTMTRILTAMSGWQPSSLFVAELHARTLAKASVRFSVFGKTNIRADPSTSLRFPSTLSCSPRTCSMP 
SPCARTALATSRICLAQRKFSGFVMARQ* 

>ORF14991 (SEQ ID NO:229) 

RRQCRARAGRWHGASPRAAREGRWEPQAGGGVGPDVRLAEHAEPNRGFGKSPSVELGDEQRRGLDNPRHRSQDSCHRAPE 
GRKPAWLRRHLLGAERQDVGCHEARDAGRHRLARSTLPKFCATDRCSCEGRQ* 

>ORF15590c (SEQ ID NO:231) 

RSSNSKRPVSQRWRAMSPMVVPASSSMATSLDAAACSSRNTSAARRPVRAAGARRGLNSRSRLIAANSLPAFTAATIGSA 
KLRKGRTGQPVPSCIPCFMASDVLSFSAE* 

>ORF15675c (SEQ ID NO:233) 

SFGRNCISSLGLLSGVSNIPDKPKNTRIALVELKKTRVTEMESYVTNGGPCLVQHGDKPRRSSVQLSQYIRGTSAGQSRR 
RQARAEFPEQAHRRELIAGLHSCNDR* 

>ORF16405 (SEQ ID NO:235) 

IDSLRKCVGSLEKCCFACKEIIHVHAIRCRQCGESQGWRRFMSSPTSWALVLSLLSIAATKPVERLFDAQRAELQISIT 
GGDYKAAQLMLTNNGSKPATLVSFEITSKATTNTKTWFLVSNTDGEILEPGKTYKIRASTDESIPKIVEAERRTILKSQY 
ALADNCELTAKYIEATGQKWRVQPFMCDTPPEKGGLPPGKPGIPIWYLGQE* 
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>ORF16925 (SEQ ID NO:237) 

RPRGRRLCVCNRSCATHLLKRVACPLVNLAYPFGTLVKNDVFMPPWALTPIKQSCVRSSNTSLAQLNDCYVY6CCRYVIP 
WPYAYEVHSESVQWTIFLLGVDCSGKVIYFRNTARVGPFLAASiyRPWYGSDALVLHFTK* 

>ORF17793c (SEQ ID NO:239) 

AKMIVIDKNLEHLVAQCAICEKTLFDEFSLKIQLGHTYYEPKSLPASASIVYGSHPAPSTFFLEPKEIQQNLVLKSGEQV 

ITCSKHRYKIPLDYFGLVQTKGTLARLFVQVTCNDGQVEPGFDGYVTLEIVNMSPWTIEIPAVSDIAQLYLVKCSTSASE 
PYHGRYMDAAKKGPTLAVFRK* 

>0RF1 8548c (SEQ ID NO:241) 

RTMAGWPRLAAQGRRTNLMSVLQIKGRTTKSHTDFDAASYSSNSLILTDAGDERIEEFSLELSVGEGWSDNYSGNDKNLW 
RIVDGmRGHDSVWEAAEEIKVPHNRYGIVLPTGSLFLSRGVL^ 

QQNAQPSM^ 

>ORF17875 (SEQ ID NO:243) 

TAYIIREDTRELPIKFSVHPTHM7GLSENHFFNRARRVMGSVDISLPRLMGLWVCVDSVEKKITEPSFSPLVRQTFLLVV 
LNILSLSLPSNAGSTFAEATSTPRERKRLPVGRTMPYRLCGTLISSAASTTTES* 
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>ORF18479 (SEQ ID NO:245) 

SVTHSSDLSFVL6LRDAATLPLSFIPADIPGYRLKDDVRKACTNLNFKRLAVIVGERERHRPYITWRQHTGTERYPASEQ 
RASRKKKRRQIFRQIEFFHGARQISLARFHDEAVIRVCEHDLAGRGASRRFSQASTPYCQAREACESEVKSNAFRGGQLT 
VGKVLD* 

>ORF19027c (SEQ ID NO:247) 

MIYSPHSLLKLVRDGKLIKHLAHRELnPEGVGFDLRLAGLSRLTVGGGSLRESTRRTPASEWLADPDDCFVMEPGKTY 
LASTMEEFDLPEDLAALFFPRSTLFRSGITFSSSVLPPGYVGPMTFALTNNHSEAFEIQIGARFAHVIFQAVSGDIGRYK 
GQWQGGRVSQPKDEGQI* 

>ORF19305 (SEQ ID NO:249) 

WPFSACRLFGMTGQVGCKRWSAPMQLGGHVRCNYAVEPGPVPPKQSIRPRWHIANKIPFPATVVLSLLPALIWRKSPLHE 
SSWSLPCFNSFPGYPGSRPPPQQPKLPQGDSSFL* 

>ORF19519 (SEQ ID NO:251) 

SGGKARSMNRHGASHVSTPFLDIQEAVPHPKNQSCPRGIHPSSEQHGTARHASPPAATGEHLAARLAIQAAIRGDLPAAT 
GQLCRAGPAAPCFGKSSPCPSRRDDRSRPGDRGLRTQGTADLPAPDRRSAGVTVSPG* 
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>ORF19544 (SEQ ID NO:253) 

IVMEPPMFQLLSWISRKPSPTPTTKAAPGGFILPLSSMELLGTPRRRQLLENIWQRASLSKQQFEEIYRRPLANYAELVQ 
QLPASENHHHAHPGGMIDHGLEIVAYALKVRQTYLLPIGAAPESQSAQAEAWSAAAAYGALAHDIGKIWDLQVELQDGS 
TWHPWNGPINQPYRFKYVKSREYQLHGAASALLIHQLLPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVKADQA 
SVAQELGGNPDRALAAPKQSLQRQLADGLRFLVKDKFKLNQPSGPSDGWLTQDALWLVSKPAADQLRAYLLAQ6IDGVPS 
SNAPFFSMLQDQAVIQTNAEDKAIWTATVDNGAGWRNKFTLLKIAPALIWTDAAERPSPYSGSLVVEDGTASTEKPETTC 
EIPNGPAEQQQAPETKMMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRG 
EENLQQPLGTKEPTDCAPEAIEDVFMPSRSTDLGQGFVGWMSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQE 
HPVLEKLAQMETTGWKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDA 
EGGVE* 

>ORF20008 (SEQ ID NO:255) 

ATGRQHLAPLERTDQPAIPLQVREVPRIPAPRRCLSTSHPPTATAHCTRLAQSLSRAVGSIDLPVRWAVRARRDPRRDHR 
EGRPGLSCTGARRQSGSSSGCTEAVAAAAVGRRPSLLGEGQVQVEST* 

>ORF20623c (SEQ ID NO:257) 

RELVSPSSTVVYRGRPNGLVLGICLDDGLVLEHAEERRVRGGHPIDTLGQQVGSQLIGSRLAHQPECVLGQPSIRRAARL 
IQLELVLHQEAKAVCQLPLQRLLRCSQSSIRIAS* 

>ORF21210c (SEQ ID NO:259) 

RLKIPGVTSMAVPSTVCTKALVSLMNRRRAAMPDFIQPTNPCPRSVLLLGINTSSIASGAQSVGSLVPSGCCRFSSPRVF 
VGEAAGESWLVSSSSSGELILPRSAYKSSVSS* 

>ORF21493c (SEQ ID NO:261) 

AAAASFHTSFGIGDDREAWWQRLLREQQFGILEQVGLELFRLARTRNLDGPDVQVFTGLPMKPLFFERALHQLPAGRLL 
GLGQFFKHRMLLDIALENSWRDQHGGPVYGMHQSLGWDEQATGRDARFHPTNESLSQISTSARHKYIFNCFRSAICWLL 
GP* 

>ORF21333 (SEQ ID NO:263) 

TSGPSRFLVLARRKSSRPTCSRIPNCCSLSSLWTTQASRSSPMPKEVWNDAAAAHRGVHLRARSPGSQREDLPRRDQGAA 
QALRSYGNRTGRGPPGCPGMATQGTGTRPVQAELEHVLESSADDLGLCHRA* 

>ORF22074c (SEQ ID NO:265) 

VRHKPLRYMYELFVSLLCLAADEDQLIFPVDVALADAQQRIQPDAGEVEDLKRAKPEPGGRDPCAFAGAALCAVLHVEP 
IAGEQDGFGGDGFACSPGGNDGGLSERVDLGVRHQLMLDGIAPDRPQMIRVRVPAPLGQALFQYLASPSQDSPWHVLYG 
CRRTEVLEQRLGRGAVDLRAGFPEIVREDVLLGELLRRHSTPPSASVMTVRLGLSRGCSGNSNLGSWSR* 



>ORF21421 (SEQ ID NO:267) 

AASGQPKPHGHHRCRRRCGMTPQQLTEEYIFAHDLREASAKIYRAATKALLKHFGPTATVQDVDHRAVLGWRRKVLEQGL 
SKRSMTYSlffiLRTIWGYAIEHELVTESQWPFRKTWIPPRRASKTVMEAILLARmNMQDGAERCTGERARITPAW 
FWLCTFEVFYFTGIRLNALLCIRKRDIDWENQLILIRGETEKTHKEFWPITEGLVPHLSRLLQEADRAGFADDDQLFNV 
MFSPHYKSKVMSDQVEAMYRKLTEKVGVRMTPHRFRHTLATDLMKAPERNIHLTKCLLNHSNIQTTMSYIEADYDHMR 
AVLHARSLAQGALENVRKVDYSGSPQASAKPKPCGQPLARVSEAPPPEARTEPAEPREHTPGTGIQGGPTAWEADALPQP 
PDTFEPSVLFTLMAQNLSNRAASASAAPAATSGSGGWGSAARSNLA* 
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>ORF20008 (SEQID NO:255) 

AT6RQHLAPLERTDQPAIPLQVREVPRIPAPRRCLSTSHPPTATAHCTRLAQSLSRAVGSIDLPVRWAVRARRDPRRDHR 
EGRPGLSCTGARRQSGSSSGCTEAVAAAAVGRRPSLLGEGQVQVEST* 

>ORF20623c (SEQ ID NO:257) 

RELVSPSSTWYRGRPNGLVLGICLDDGLVLEHAEERRVRGGHPIDTLGQQVGSQLIGSRLAHQPECVLGQPSIRRAARL 
IQLELVLHQEAKAVCQLPLQRLLRCSQSSIRIAS* 

>ORF21210c (SEQ ID NO:259) 

RLKIPGVTSMAVPSTVCTKALVSLMNRRRAAMPDFIQPTNPCPRSVLLLGINTSSIASGAQSVGSLVPSGCCRFSSPRVF 
VGEAAGESWLVSSSSSGELILPRSAYKSSVSS* 

>ORF21493c (SEQ ID NO:261) 

AAAASFHTSFGIGDDREAWWQRLLREQQFGILEQVGLELFRLARTRNLDGPDVQVFTGLPMKPLFFERALHQLPAGRLL 
GLGQFFKHRMLLDIALENSWRDQHGGPVYGMHQSLGWDEQATGRDARFHPTNESLSQISTSARHKYIFNCFRSAICWLL 
GP* 

>ORF21333 (SEQ ID NO:263) 

TSGPSRFLVLARRKSSRPTCSRIPNCCSLSSLWTTQASRSSPMPKEVWNDAAAAHRGVHLRARSPGSQREDLPRRDQGAA 
QALRSYGNRTGRGPPGCPGMATQGTGTRPVQAELEHVLESSADDLGLCHRA* 

>ORF22074c (SEQ ID NO:265) 

VRHKPLRYWHYELFVSLLCLAADEDQLIFPVDVALADAQQRIQPDAGEVEDLKRAKPEPGGRDPCAFAGAALCAVLHVEP 
IAGEQDGFGGDGFACSPGGNDGGLSERVDLGVRHQLMLDGIAPDRPQMIRVRVPAPLGQALFQYLASPSQDSPWHVLYG 
CRRTEVLEQRLGRGAVDLRAGFPEIVREDVLLGELLRRHSTPPSASVMTVRLGLSRGCSGNSNLGSWSR* 



>ORF21421 (SEQ ID NO:267) 

AASGQPKPHGHHRCRRRCGMTPQQLTEEYIFAHDLREASAKIYRAATKALLKHFGPTATVQDVDHRAVLGWRRKVLEQGL 
SKRSWiraYSNHLRTIWGYAIEHELVTHSQWPFRKTTVIPPRRASKTVMEAILLARlMiMQDGAERCTGERARITPA^ 
FWLCTFEVFYFTGIRLNALLCIRKRDIDWENQLILIRGETEKTHKEFWPITEGLVPHLSRLLQEADRAGFADDDQLFNV 
NRFSPHYKSMSDQVEmRKLTEKVGVmPHRFRHTLATDLMKAPERNIHLTKCLLNHSNIQTTMSYIEADYDHMR 
AVLHARSLAQGALENVRKVDYSGSPQASAKPKPCGQPLARVSEAPPPEARTEPAEPREHTPGTGIQGGPTAWEADALPQP 
PDTFEPSVLFTLMAQNLSNRAASASAAPAATSGSGGWGSAARSNLA* 

>ORF22608c (SEQ ID NO:269) 

RICFPRGWTSLNACPWRVLPWFCRLCPGLRWRRFniSSERLPAWLRFGRGLRGAAVIHLPDILQRALGQASSMQHGTHVI 
WGLDVAHRGLDIRVVEQALREVNVPLGCLHQVGGQGVPEWRGHPHPNLLGQLPVHGFDLVGVHHLALVVR* 

>ORF22626 (SEQ ID NO:271) 

HLRTKRAVHSDGSKLIEPCRLGIRGSRCNKRIRRMGICRPKQSRLAIPVLRAGYRTKGSRAFQQIVRPVGKIWNYRERLD 
SSAGMLAEPAQFQGQYHSTLCNR* 
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>ORF23228 (SEQ ID NO.273) 

RDSNSRHPAPKAGALPDCAIPRLEF6SATWTRTRDPMINSHLLYRLSYRGTSFFQPWTLPVLLDSRLR6APFY6CARACQ 
PSDPKSFSSFSTSDKTALPLHAAALSRLPDAHEKAPPKRGFPCPPPKRSGEDDLVAFHLRRDT6TRREFAGQDQLRQRVL 
DPALDGPLQRACAIDRVEADGNQLVQRLLAQFQAQLALGQALAQATELDLGDAGDLLASQRLEHHHFVDPVDEFRTEVRI 
DRVHHCGTLRLAVAGQLLDLRRTEVGGHHHHGVAEVHRTPVTVGQASVLEHLEENVEYIRMGLLHLVQQHHRVGLAADRL 
GQVAAFLEADVARRRADQAGHRVFLHELGHIYPHQRLLGIEEELGQRLAQLGLAHPGRAEEEERAARPVRIGEAGARTAH 
GVGHGDYRLVLADHSPMQLLLHAQQLLALALEHLRHRDTGPLGIfflFGDFLVGHLVAQQLVLGLAVLVDHLQAAFQVRDGL 
VLDARHALEVALAPRRLHLLLGLLDLLLDLRRALHLGLLGLPDLLEVGVFALELDDILLQLGQALPGGFWFLLQRLALD 
LQLDQATVETIQFLRLGVDLHADAAGGLVDQVDGLVRQLPIGDVAVRQLGRGDDRAVGDAHPWHFIAFLEATEDGDGVF 
LARFVHQHLLEAALQRGILLDVLAILVEGSSTDAVQLAARQSRLEHVAGVHGTFRLAGADHGVQFVDEQDDPAFLLAQFV 
EDRLQAFLELAAELGTGDQRPHVQGQQALVLEAVRHFAVDDALGQALDDGGLADAGFADQHRWLGPPLQDLDGPADLW 
ATDHRVELAFLGALGHVDGVLVQRLARLLDVRWHRFAATQVGHGILQRLARHALAEQQLAEPGVLVHRGQQYQLAGDEL 
VALLLGQAVSLVEQACEILGQVHVAGRALDLRQRVEFFVEAAAQGGDIEADLHQQGLDRTALLLEQGGKQVHRLDGRMVM 
ANGQGLGVGERQLQLAGQTVYSHGSSFLL* 

>ORF23367 (SEQ ID NO:275) 

AIAERLSSNPGRFRCCWIRVSEARHFTDARGHVNPLIQKVFLLFPRATKRPFHCMRQRSRAYRTPMKKPRRSGAFPVRPR 
RGQAKTISSPSTFAEILAPGANLPARISCASGFSIQRWMARFSGRAP* 

>ORF25103c (SEQ ID NO-.277) 

SAPRGEHHRRRDHRRGQAVAPLHHRSATAGQGHRPDRRGRQPHPHGDRLQAGGTGSSRPSPDPAEDRARGAEEGRRRSHQ 
EAPGQAGGGYRQARARIRRPRGDLEVREGRGAGLGADPAEDRAGQAGDGGGAAQGRPREHGAHPVPDHPGPGTQPADGRP 
ARQDREPVAAQQGDRRGNRRSGFQVDRYPGVEDARGRAREAAAHGAGAASASDRPGRGGSRRVQRRAPFARRPRRSEPAE 
RLVPLPRPDRGGQDRWQGAGRVPLRYRGGAGADRYVRVHGETLGGPPDRRASGLRRLRGRRLPDRGDPPQALLGGAAGR 
GGEGPSGCIQHSPPGARGRTPDRQSRAYGGLPQHRGGDDLQPRFGADPGAGRRPRGATCRSDGRGQCALPSGIHQPDRRS 
GGVRAAGSRADRRHRRDPARSPAQAPGRARAEPGTEPGGAGQADCRRLRPGLWRTPAEAGHPALDREPAGATDPGRQIRA 
GCQYLGEGGRRRDRLRLTSSGADRESPASAGLFHGRPVGARALPHAVEGPFCRSWKKKKNFLDQRVDMPARIRKMARL* 

>ORF23556 (SEQ ID NO:279) 

KSPAEAGLSLSAPEEVRRRRSRRLPPSPRYWHPARICRPGSVAPAGSRSSAGWPASAGVRHRPGRSRRQSACPAPPGSVP 
GSARARPGACAGDRAGSRRCRRSAREPAARTPPLRRSG* 

>ORF26191c (SEQ ID NO:281) 

KEGRPMRIDRLTSKLQLALSDAQSLAVGHDHPAIEPVHLLSALLEQQGGSIKPLLMQVGFDIAALRSGLNKELDALPKIQ 
SPTGDVNLSQDLARLLNQADRLAQQKGDQFISSELVLLAAMDENTRLGKLLLGQGVSRKALENAVANLRGGEAVNDPNVE 
ESRQALDKYTVDMTKRAEEGKLDPVIGRDDEIRRTIQVLQRRTKMPVLIGEPGVGKTAIVEGLAQRIINGEVPDGLKDK 
RLLALDMGALIAGAKFRGEFEERLKAVLNELGKQEGRVILFIDELHTMVGAGKAEGAMDAGNMLRPALARGELHCVGATT 
LDEYRQYIEKDAALERRFQKVLVDEPSEEDTIAILRGLKERYEVHHGVSITDGAIIAAAKLSHRYITDRQLPDKAIDLID 
EAASRIRMEIDSKPEELDRLDRRLIQLKIEREALKKEDDEATRKRLAKLEEDIVKLEREYADLEEIWKSEKAEVQGSAQI 
QQKIEQAKQEMEAARRKGDLESMARIQYQTIPDLERSLQMVDQHGKTENQLLRHKVTDEEIAEVVSKWTGIPVSKMLEGE 
REKLLRMEQELHRRVIGQDEAWAVSNAVRRSRAGLADPNRPSGSFLFLGPTGVGKTELCKALAEFLFDTEEALVRIDMS 
EFMEKHSVARLIGAPPGYVGFEEGGYLTEAIRRKPYSWLLDEVEKAHPDVFNILLQVLEDGRLTDSHGRTVDFRNTVVV 
MTSNLGSAQIQELAGDREAQRAAVMDAVNAHFRPEFINRIDEVWFEPLAREQIAGIAEIQLGRLRKRLAERELSLELSQ 
EALDKL IAVGFDPVYGARPLKRAIQRWIENPLAQLI LAGKFAPGAS I SAKVEGDEIVFA* 
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>ORF23751 (SEQ ID NO:283) 

TGSKPTAISLSSASWLSSRLSSRSARRLRRRPSWISAMPAICSRASGSNTTTSSIRLMNSGRKCALTASITAARCASRSP 
ASSWICAEPRLEVITTTVLRKSTVRP* 

>ORF24222 (SEQ ID NO:285) 

P66APIRRATECPSMNSDISIRTSASSVSKRNSASALHNSVLPTPVGPRKRNEPLGRFGSARPARERRTALDTATTASSW 
PITRRCSSCSMRSSFSRSPSSIFDTGIPmETTSAISSSVTLLRSMFSVLPCWSTICRLRSRSGMVWYMRAMLSRSP 
LRRAASISCLACSIFCWICAEPCTSAFSDFQISSRSAYSRSSLTISSSSLARRFLVASSSSFFSASRSIFSWIRRRSRRS 
SSSGLESISMRMRLAASSIRSMALSGSCRSVM* 

>ORF24368 (SEQ ID NO:287) 

TRTYLSAPAPPRYRRGTRPAPCTTRSCPPRSGRGRGTSRSAGSDRRGRRANGARRWTRRLPPRPGRSLADAAPAPCAAAS 
RARPRASSTPGYRSTWKPLRRFPRRSPCCAATGSRSCRAGRPSAGCVPGPGWSGTGCAPCSRGRPCAAPPPSPAWPARSS 
AGSAPSPAPRPSRTSRSPRGRRIRARA* 

>ORF24888c (SEQ ID NO:289) 

RRKTTKPPGSAWPSWRRISSSSSANTPTSRRSGSPRRPRCRARRRSSRRSSRPSRRWRRRGARATSRAWRASSTRPSRTW 
NAACRWSTSTARPRTSCCATR* 

>ORF25398c (SEQ ID NO:291) 

RRSSTNWASRKAGSSCSSTNCTPWSAPARRKVPWTPATCSSRLWRAASCTASVLLPSTSIASTSRRMPRWSAASRRCWWT 
NRARKTPSPSSVASRNAMKCTTG* 

>ORF25892c (SEQ ID NO:293) 

PPGPAEGRPVHLQRAGIAGRDGREHQARQAAARPGRVAQGAGECRGQPAWRRSGERPERRGVAPGAGQVHRRHDQARRGR 
QARPGDRSRRRDPPDHPGPAAADQEQPGADRRTRRRQDRHRRGPGPAHHQRRSAGRPQGQAPAGPGHGGADRRCQVPRRV 
RGTPEGGPQRTGQAGRPGHPVHRRTAHHGRRRQGGRCHGRRQHAQAGSGARRAALRRCYYPRRVSPVHREGCRAGAPLPE 
GAGGRTERGRHHRHPPWPQGTL* 

>ORF25110 (SEQ ID NO:295) 

RSLRPRRMAMVSSSLGSSTSTFWKRRSSAASFSMYWRYSSRVVAPTQCSSPRARAGLSMLPASMAPSALPAPTMVCSSSM 
NRMTRPSCLPSSLRTAFRRSSNSPRNLAPAISAPMSRASRRLSLRPSGTSPLMMRWARPSTMAVLPTPGSPISTGLFLVR 
RCRTWMVRRISSSRPITGSSLPSSARLVMSTVYLSSAWRDSSTFGSFTASPPRRLATAFSSALRDTPWPSSSLPSLVFSS 
IAASNTSSLEMNWSPFCWARRSAWLSRRARSWDRFTSPVGLWIFGSASSSLLRPLRRAAISKPTCISRGLIEPPCCSSRA 
ESRCTGSMAGWSWPTARDWASESASCSLLVKRSIRMGRPSFYRAGRNDGCP* 

>ORF25510 (SEQ ID NO:297) 

CAGPGPRRWRSCRRRVRRSAPGCSWSAAAGPGWSGGSRRRDRSPGRACLPRRAWSCRRCTCPAPGATPRRSGRSPLRRHA 
GWPRHSPAPCATRPGRAAACRAWCSRPSRPAIPARWR* 

>ORF26762c (SEQ ID NO:299) 

PPTACRRCSATARAPGWPRPMPAGAGWRRACMRRWTAWACPATNCWSGWGRRSARRPSRSAARSAMHSSLRTPRRARLS 
YLAPIRAASWPTSTDSRGSAWAPMASPPCMAAASAPSAIPRASIPTAARRVPAVLPAWSGSRTRPAQVIRRQLTDVTVRS 
LEPRKIALIY* 

>ORF26257 (SEQ ID N0:301) 

IRAIFRGSSDRTVTSVSCRRITCAGLVLEPDQAGKTAGTRRAAVGIEARGIAEGAEAAAMHGGDAMGAQADPRESVDVGH 
EAARIGARYESRARLGVRSDECIADLAADLEGLRADRRPQPDQQFVAGHAQAVHRRLQHARRQPAPAGMGRGHPGARAVA 
EQRRQAVGGHDRTGDARHRAPAGVGPEHRFGSASTTSLRAPIPTSSAGIPDARSGVGGFIPRRADRRRRGRQGSDCRRVP 
G* 
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>ORF26844c (SEQ ID NO:303) 

RGGGRPEPVLRADASWSJ^GVACTIMTADCLPALFCDRSGTRVAAAHAGWRGLAAGVLEATVDSLGVPGDELLVWLGPA 

IGPQAFEVGGEVMAFVAAHAEARSAFVPSANPGRFMADiyRLARIRLGAHGVTAVHGGGFCTFSDTARFYSYRRSSRTG 
RFASLVWLQD* 

>ORF26486 (SEQ ID NO:305) 

MSAMKRPGLALGTKAERASACAATNASRTSPPTSKACGPIAGPSQTSSSSPGTPRLSTVASSTPAASPRQPAWAAATRVP 
ERSQNNAGRQSAVMIVQATPGIALQLASALSTGSGRPPPRHSVHLFQPARPAFQTLGQASAVLFHGARIWDVGMVQTV 
EGCLADPATARGHAGPHTGRRRPVGGQPGVQPTNASRSWRNRLSSQRNSSGSGDSHFMRTPVAG* 

>ORF26857c (SEQ ID NO:307) 

VHGVTWWRPTRTGAQGRRQLERDAGRRLYDHDRRLPAGVVLRPLGHPGGRGPCRLARAGGGRAGGDGGQPGRARRRTAGL 
AGAGDRPAGLRGRRRGPRCIRRCARRGALGFRT* 

>ORF27314c (SEQ ID NO:309) 

SGNRRCRKNSSGCSACCARIARRSSVERLADPRLAGAGPCAGLRDHAQWRGQPGTLRQSEPWRPRLRRSARRGIKPPTPD 
RASGMPAELVGIGARSDWEADPNRCSGPTPAGARCRASPVRS* 

>ORF27730c (SEQ ID N0:311) 

QARRPGGPSGCRPSGRHPAECLALPCPGHRQCAARRDRPPPGQGHDRPDGSGQDAGGPHQAGGATAGTVGQPHLRGDRDR 
RDHLRRHHRCADRTAWRAAAEDGGGRRRQGGGQPLPRAGTLPCAHPYPGQAGDRAYPPDPRAHEPYWLSPGRRSGLRWAL 
QDSPGGQPDPGPDSSRIPPAGAARALPRTGSPGHRRAHEVGIAAAGRIPLAAQPVAPGSRGVRRLNAWLTPDWPAPARVR 
ACVTTRSGGVSQAPFDSLNLGAHVYDDPRAVE* 

>ORF26983 (SEQ ID NO:313) 

PRHCAWSRRPAHGPAPASRGSARRSTDERLAILAQQAEQPEEFFRQRRFPLHAHAGGRVIQFEEARVQRLPGEFAKSLDQ 
GLAGHRGNPEAPTmiADQGIAmBTHADLVGTPGLQLDPGM 

IDGAAGGDHADHDRLVDAADRPCLQLRHQLGVGLQRLGHYHQAGRVLVQAVDDPGARHIGDVRDMVEQGIQ^GAVmGS 
RMDHQAGGLVNHQDVLVLVDDFQLDVLCEPLALGFLLGLQDQLRAAVDDVARAQHGAVDGQATVLDPAGQTGAGVFGKKL 
GGDLVETLATQLERHLGRALNHIGHE* 

>ORF28068c (SEQ ID NO:315) 

PQRVADSKSRAEHRLLLMSDMIQRAAEVPFELGGQRLDQIAAQLFPEHSRSRLAGWIKDGRLTVDGAVLRPRDIVHSGAQ 
LVLEAEQEAQGEWLAQDIELEIVYEDEHILVIDKPAGLWHPAAGHQDGTLLNALLYHVPDIANVPRAGIVHRLDKDTTG 
LMWAKTLEAHTKLVAQLQARSVSRIYEAIVIGVITSGGTIDAPIGRHGVQRQKMAWDAGKVAVSHYRVLERFRAHTHT 
RVKLETGRTHQIRVHMSHIGYPLVGDPVYGGRFRIPPVASQTLVQTLREFPRQALHARFLELDHPATGVRMKWESPLPEE 
FLWLLSLLRQDREAFVG* 

>ORF27522 (SEQ ID NO:317) 

PTVPAVAPPAWCGPPASWPLPSGRSCPCPGGGRSRRAAHWRCPGHGRARHSAGCRPDGRQPDGPPGRRACQSPGCARPRR 
RFPARCPVRATRPGLPARPPGPVARRCGRCRAGAARRRRRSGDRP* 
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>ORF28033c (SEQID NO:319) 

ASSPTHVRYDSTR6RGAVRAGWPASRPDRRPAFSRTLPLPS6RLDQGRSPDRSRRRAAPARHRPQRRATGP6GRAGSPGR 
VARTGHRAGNRLRGRAHPGD* 

>ORF29701c (SEQ ID NO:321) 

SSSSLEISRTSTRPl^RYRMRRRMRCPCSSLSRSRSARWALVLAQVRLAAIPALFVGEGVGLRHVDAAMGAADHRRC 
ARLVLRTLLLARSGTGKATPEPERDGDQGDPEQEAEKAHGDLGGWRKLQFSQAAGSIPDGKVQAVRRLALGEAWRRAKRR 
EACASLRCFVRSVEETEGHVAPPGATGVLWIALRLLWGAVILVFRLQFGGDLPLGILVLLDHVLGGLGFHVRRRLAAF 
DQAQGGLGQPGAGVGLAFAGDELAILEAGVIRIVQLEGFQAGAGQWETQATVGFDHDRQAIADGRGFLEVLHHVATAVG 
GGDIGLALQVWADVHFVGRQQVAQVHHARLGVRGVAAVGEAAGELGELVEGVAGGARVALGHVQRQEARQQAAVLVEGG 
QAFEWGVVDVGVLRMQADEAFGGGAGGFGLHVLVVGVDQLELGLLGVAAEGIARFEGFQLGDGAVVALVVEVVLRLLVQ 
LALAQVLVDSLLVRGAGCGEGEDGDQQQVFHLHGGLRPWDGRLGLNRLL* 

>ORF28118 (SEQ ID NO:323) 

QTVEMPWPGTQAAMQVIOILLLIAILALTAACSSMETVDmSESQLYQQAQDDLMKSYNSAVTKLmESRYPFGR 
YAEQAQLELIYANYKNMEPEAARAAAERFIRLHPQHPNVDYAYYLKGLSSFDQDRGLLARFLPLDMTKRDPGAARDSFNE 
FAQLTSRFPNSRYAPDAKARMVYLRNLLAAYEVHVGHYYLKRQAYVAAANRGRYVVENFQETPAVGDGLAIMVEAYRRLG 
LDDLASTSLETLKLNYPDNASLKDGEFVARESEADTRSWLAKATLGLIEGGEPPPHMETQAAKDVIKQYEDAEREIPAEL 
KPENQDHSADDEKPESDDDEDSGRSWWSYMTFGLFD* 

>ORF28129 (SEQ ID NO:325) 

GQADRPRDASRHASETPAADRHPRPHRSLLLEQGDCRREPEREPAVPAGAGRPQQQELQQRRHQAESPRIALSLRPLRRA 
GPARADLRQLQEHGARSRPRRRRTLHPPASAAPQRRLRLLPQRPVLLRPGPRPAGALPAAGHDQARPGRRPRLLQRVRPA 
HQPLPQQPLRPGRQGAHGVPAQPAGGLRSARRPLLPEAPGLCRRRQPRSLRGGELPGNPGRRRWPGDHGRSLPSPGSRRP 
GQHQPGNPQAELSG* 

>ORF29709c (SEQ ID NO:327) 

GPDLPVRWRSAGPVPGRWSGGTGGTDGGCVAPAPRSAVAVQRARSPWSWRRCAWLQYQRCSWARALACGTWTPQWAQRTI 
GDALGSSCGRCCWRGVGRVKRRQSQNAMAIRAIQNRRRKRPMVISEAGESCSLAKPPARSQTGRSRLCGVWRWERHGGGQ 
KEGRPAPPFGVSCDQSKRPKVM* 

>ORF29189 (SEQ ID NO:329) 

SHETPKGGAGLPSFCPPPCLSQRQTPHSLDLPVWDRAGGLAKLQLSPASEITMGLFRLLFWIALIAIAFWLWRRFTRPTP 
RQQQRPQDEPSASPMVRCAHCGVHVPQANALAHEQRWYCSQAHLRQDQGDRAR* 

>ORF29382 (SEQ ID NO:331) 

SPSRSGSGVALPVPLRASSNVRRTSRAHRRWSAAPIAASTCRRPTPSPTNNAGIAARRTCARTRATVRAERLRLSEEQGQ 
RILRLYHLYRLTIGLVLVLLISSELEDQVLKLVHPELFHVGSWCYLVFNILVALFLPPSRQLLPIFILALTDVLMLCGLF 
YAGGGVPSGIGSLLWAVAIANILLRGRIGLVIAAAASLGLLYLTFFLSLSSPDATNHYVQAGGLGTLCFAAALVIQALV 
RRQEQTETLAEERAETVAN1EELNALILQRMRTGILWDSRQAILLMQAALGLLRQDDVQGASLGRHSPMLHHCMKQWR 
LNPSLRPPTLKWPDGPTVQPSFISLNREDDQHVLIFLEDISQIAQQAQQMKLAGLGRLTAGIAHEIRNPLGAISHAAQL 
LQESEELDAPDRRLTQIIQDQSKRMNLVIENVLQLSRRRQAEPQQLDLKEWLQRFVDEYPGRLRNDSQLHLQLGAGDIQT 
RMDPHQLNQVLSNLVQNGLRYSAQAHGRGQVWLSLARDPESDLPVLEVIDDGPGVPADKLNNLFEPFFTTESKGTGLGLY 
LSRELCESNQAEIDYRNREEGGGCFRITFAHPRKLS* 

>ORF30590c (SEQ ID NO:333) 

LLQQLGGVADRAQRVADLMGDAGGQAAKTGQLHLLRLLGDLRNVFEEDQHVLVVFAVEADKAGLHRRAIRHHLERRRTEA 
GIQAPLLHAVHQHRAVAAEAGALHWLPEQAEGGLVGEEDGLTAIDHEDAGAHALQDQCVEFLQVGDRLGAFFGQRFGLL 
LAPHQSLDHQRGGEAQGAEAAGLDVWGGVRTAQAEEEGQVEQAEAGRRRDDQADAPAQQDVGNGHRHHQQAADAAGYAA 
TCVEQAAKHQHVGEREDEDRQQLPRRRQEQRDQDVEDQVAPTADMEQFRVDELEDLIFQFAGDQQDQYQADGQAVQWQT 
EDALPLLLAQP* 
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>ORF29729 (SEQ ID NO:335) 

TVPCRQLVLPGLQHPGRAVPAAVAAIAADLHPRAHRRADALRPVLRRWRRTQRHRQPA6666GHCQHPAARAHRP6HR6G 
GQPRPALPDLLPQPEQSGRHQPLRPGRRPRHPVLRRRAGDPGSGAAPGADRNAGRRTRRDGRQPGGTQRIDPAAHAHRHP 
RGR* 

>ORF30221 (SEQ ID NO:337) 

PSGHPPRQPGRPRPAQAGRRAGRQPRPPQPDADALHEAMAPESQPPSADAQGGAGWPDGATQLYQPQPRRRPARADLPRR 
HFADRPAGAADEAGRSWPPDRRHRP* 

>ORF30736c (SEQ ID NO:339) 

SHSFRSSCCGSAWRRRESCRTFSMTRFIRFDWSWMICVRRRSGASSSSDSCSSWAAWLIAPSGLRISWAMPAVRRPRPAS 
FICCACWAICEMSSRKISTCWSSSRLRLIKLGCTVGPSGTTLSVGGRRLGFRRHCFMQCISIGLWRPRLAPCTSSCLSRP 
RAAWLARRMA* 

>ORF30539 (SEQ ID NO:341) 

DPQPAGRDQPRRPTAAGVRGTGCPGPTPDADHPGPVEADEPGHRERPAALPSPPGRTAAARPEGVASAVRRRIPRQAAQR 
QPTAPAARCRRHPDPHGPTPVEPGAEQPGAERSSLQRPGARARPGLAEPRARPGERPAGAGSHRRRSRRTGGQTEQPVRT 
LLYYRKQRHRPGPLSLPRTLREQPGTDRLPQSRGRRRLLPHHLRPPAQTQLTEAARMSRQKALIVDDEPDIRELLEITLG 
RMKLDTRSARNVKEAASCWPASRSTCASPTCACRTAAASIWSSTSSSAIHRPRWP* 

>ORF31247c (SEQ ID NO:343) 

FPAVRGYPVHRRRSGLFVGSCVRLPSAEFARVGEGDAEAAAAFLAIAWDPCLVALAEFAGEIEAQAGAFAFCSKEGFEQ 
WQFVRRYAGTVVDDFQHRQVALRVAREAQPDLAAPVRLGAVMTVLHQVAQHLVQLVWVMGLDVAGTELQVQLAVVAQ 
PAGVFVDEPLKPLLQVELLRFGLAATGELQDVLDDQVHPLRLVLDDLRQASVRGIQFL* 

>ORF30963c (SEQ ID NO:345) 

LPAPAGRSPGRARGSARPGRARAPGRCSEDRSAPGCSAPGSTGVGPCGSGCRRHRAAGAVGCRCAACRGIRRRTAEATPS 
GRAAAVRPGGDGRAAGRSR* 

>ORF31539c (SEQ ID NO:347) 

GGCHQLPQATEVDRFGEEVEGTGLERLDRGVQAAVRGDHGHRGLWMALLDVLDQIEAAAVRQAHVGEAQVERLAGQQLAA 
SLTLRALRVSSFMRPRVISSSSRISGSSSTIRAFCRLMRAASVS* 

>ORF31222 (SEQ ID NO:349) 

TGYPRTAGNHSRPHEAGHPQRPQRQGSRELLAREPFDLCLTDMRLPDGSGLDLVQYIQQRHPQTPVAMITAYGSLDTAIQ 
ALKAGAFDFLTKPVDLGRLRELVATALRLRNPEAEEAPVDNRLLGESPPMRALRNQIGKLARSQAPVYISGESGSGKELV 
ARLIHEQGPRIERPFVPVNCGAIPSELMESEFFGHKKGSFTGAIEDKQGLFQAASGGTLFLDEVADLPMAMQVKLLRAIQ 
EKAVRAVGGQQEVAVARAHPLRHPQGPRRRSRRRALPPGPLLPPQRHRAARTPLRERREDIPLLAERILKRLAGDTGLPA 
ARLTGDAQEKLKNYRFPGNVRELENMLERAYTLCEDDQIQPHDLRLADAPGASQEGAASLSEIDNLEDYLEDIERKLIMQ 
ALEETRWNRTAAAQRLGLTFRSHRYRLKKLGID* 

>ORF31266 (SEQ ID NO:351) 

SWTPAAPATSRKPRVAGPRAVRPVPHRHAPAGRQRPRSGPVHPAAPSTDPGGHDHRVRQPGHRDPGAQGRCLRLPHQTGR 
PRSLAGAGGNRPTLAQPGSRGSAGGQPPARRVAADARPAQPDRQAGAQPGAGLHQWRVRQRQGTGGAPDPRAGATYRAAV 
RAGELRRDSLRADGKRVLRPQERQLHWRYRRQAGPVPGRQRWHPVPRRSRRPADGHAGQTAPGDPGKGRARGRRPAGGRR 
RTCASSAPPTRTSPPKSAPGASARTSTTASTSSSCAYTAARTPRGHPAARRTHPQAPGRRHRPAGRQADRRRTGEAEELP 
LPGQRPRAGKHAGARLYPVRRRPDPASRPAPGRCAGCQPGRRREPERNRQPRGLPGRHRAQADHAGTRGDPLEPHRRGPA 
PGPDVPLDALPPEKAGHRLKVKRPVRRQAFWFSLLRGDQPGRRGPGR* 
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>ORF31661c (SEQ ID NO:353) 

TGAWLRASLPIWLRRARIGGDSPSRRLSTGASSAS6LRKRRAVATSSRKRPRSTGLVRKSKAPALSAWIAVSRLPYAVIM 
ATGVCGWRCWMYWTRSRPLPSGRRMSVRHRSNGSRASNSRLP* 

>ORF32061c (SEQ ID NO:355) 

RSWRKRPAPTSAARSLWVAQRMRTCDGDLLLAADRAHGLFLDRPEQFDLHGHRQVGDFVEEQGATAGGLEQALLVFDSAS 
EAAFLVAEELAFHQLGGNRMVHRHERPLDTWPLLVDQARHQFLAAAGLATDVDRRLAARQLADLVAQGAHRRRLAEQAV 
VHRRFLGFRVAQA* 

>ORF32072c (SEQ ID NO:357) 

GGSRGPGGSARRRLRRRGPCGWRRGCARATATSCWPPTARTAFSWIARSSLTCMAIGRSATSSRNRVPPLAAWNRPCLSS 
IAPVKLPFLWPKNSLSISSEGIAPQFTGTNGRSIRGPCSWIRRATSSLPLPDSPLM* 

>ORF31784 (SEQ ID NO:359) 

WKASSSATRKAASLALSKTSRACSRPPAVAPCSSTKSPTCRWPCRSNCSGRSRKRPCARSAASRRSPSHVRILCATHKDL 
AAEVGAGRFRQDLYYRLNVIELRVHRCANAARTSRCSPNASSSAWPATPACRPPG* 

>ORF32568c (SEQ ID NO:361) 

GAKTKRPVFGQAFSLSVDAQLFQAVAHRAERQAQALGRGGAVPAGLLECLHDQLALDVFQWLEWDFAQARGAFLAGTR 
RIGQAQmLDLVVFAQGIGALQHVFQIADVAREAVVLQLLLCVAGQPGGRQAGVAGQALEDAFGEQRDVLAAFAQRCTR 
SSMTLRR* 

>ORF33157c (SEQ ID NO:363) 

TDGGARLVARRRSSGRRGDLAWRDPWRQGAAGGRRLERRWEAAWPGTARGTGERSDDPLQVRGGFPAAHGAGQGALRDS 
AARRPHPDRQHLGTFGLRQDADRRGAGKPQGVCGRTVAGTGGHAAGGPLGRVAPGLSRRHPLYRSGAWLRRALAEYRALP 
QRAGPGTGWPSAGGSHERAGTDHRPGPLRPGWSPLRSENQKACLRTGLFTFSRCPAFSGGSASSGTSGPGAGPRRCGSS 
GSPRVPA* 

>ORF32530 (SEQ ID NO:365) 

KGLSEDRPFGFRSSEATSRGVGGRVDDRFPPAHEIRQQTARRRCQDQPVAWPGIQPEPVEARHRTDIGDAFGRARAQPC 
PVGHRLHVRQFRQQFCRRRPEAFQRLVGRRLVEARMFQGAADQDVAVAPRNRVAPLGQHHARQEIRRALVEDHLTFHRYH 
GQFQAKRLQQLAAPGACRQQHLVATDLATRGRHADHSIAVAQPAAHLRLFMQLEIGELLQGCPQ* 

>ORF33705c (SEQ ID NO:367) 

VIFLCSWQIGRSPWSRDWWGAGVIGLLTAREIALAGLRVTLVERGESGREASWAGGGIVSPLYPWRYSPAVTALAHW 
SQDFYPALGQRLLDETGLDPEVHTVGLYWLDLDDQTEALQWARHHTRPLREVPIEEAYAAVPGLGAGFQRAVYMSGVANV 
RNPRLARSLRASLQQFANLELHEQTEVRGWLRDGDRWGVATSRGEIRGDKVLLAAGAWSGELLKPLGLELPWPVKGQM 
ILYRCAADFLPRMVLAKGRYAIPRRDGHILIGSTLEHSGFDKTPTDEALESLRASAAELLPELADMQPVAHWAGLRPGSP 
EGIPYIGPVPGFDGLWLNTGHYRNGLVLAPASCRLLADLMSGREPIIDPAPYAPAGRL* 

>ORF32832 (SEQ ID NO:369) 

GFPAPRRSASCRSPNVPRCCRSGCGRRAAESRSAPWPAPCAAGNPPRTCRGSSDLSPVPRAVPGQAASTTRRSRRLPPAA 
PCRHGSRHARSPRRPLDRRRATSRAPPSVHATRDWRIVAGMPAMSAPGEDCAHWPRPTCRPPAGSLRPARAPPRRPPRSA 
LPSTAGCGCVPTAVPRSGRPGPASTGQRYGPRDRARSRRANAAPGPGRSPATSAPGRSPPGCSATDRAARRSRLPPRMPH 
AHSRPAPPGSPAVRRAPAPGRSTGR* 
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>ORF33547c (SEQ ID NO:371) 

GILGGRRDRLAALSVALQP66DRP6ALVAGLLP6P6AAFARRDRARSRGPYRWPVLA6PGRPDRGTAV6TQPHPAVEGSA 
DRGGLRGGARAGRRLPAGGLHVGRGQCAQSSPGALIAGIPATIRQSRVA* 

>ORF33205 (SEQ ID NO:373) 

ARQARIAHIGHARHVDRPLEACAQPGHRRVGLLDRHFLQRPGWACPLQCLGLWQVQPVQANGMDLGIEPGLVEQTLPQ 
GRVEVLRPVRQGGHRRAVAPRIERRDDPASRPGCLTPTLAPLHQGHPQSGERQLPGGQQADDASAYHYYISTHHRAPTDL 
PGTEKYHSKGSDADELPASTNSVESSPGEKPIIPAEVFIP* 

>ORF33512 (SEQ ID NO:375) 

SGETIPPPAQDASRPLSPRSTRVTRSPASASSRAVNRPMTPAPTTTTSLLTTGLLPICQEQRNITQRDQMLTNCLLQRTQ 
SNLVPVKSPSYPQRYSSHEIEWFEFGGTIDSPIDPCDRRDNCAAHPPRQNEAGH* 

>ORF33771 (SEQ ID NO:377) 

KAHHTRRGIHPMSSGLNLVELSIVLSILAIGVTIALPTLPDRMKRDISRDIGDSLTSHVMAARASSIQNGVIIEVCGSG 
DGSTCSEEWHLGWFSRNDRSQQILARHENTSRTDIHWRGFDKRLRYLPNGTSPTGNGRFFECKDDRIEWQLVLNRQGRLR 
VAGKSENKKLSYLCSRR* 

>ORF34385c (SEQ ID NO:379) 

WRAHCPCSRDSRSGEWDDWQTVCETVLSPPGAQVRELFIFALSRHPEAALPIEHQLPLDAIVLTFEETPVTCRAGAIRQV 
AQSLVEAPPMIGATRIFMPGQYLLAPVVTAEPAEMPFLAAGTAVTATAHLDDHAVLYAASPRSHHMTSQAVTNITAHVP 
LHSVGEGGQRNCHAYRKDR* 

>ORF33988 (SEQ ID NO:381) 

SSRCAVAVTAVPAARNGISAGSAVTTGANRYWPGMKIRVAPIFIGGASTSDCATCLMAPALQVTGVSSNVRTIASSGNWC 
SIGKAASGWRERAKIKSSLTCAPGGERTVSHTVCQSSHSPLRLSLLQGQCALH* 

>ORF34274 (SEQ ID NO:383) 

KALLPVLQAVRELFHIPFASHPTLRSGCLCYRDNALSTRQDYLALFLVEYCMRSICRSAGFSLIELMMVLVLVAIFASIA 
VPSFNALIERNRIQTASEELYSLLQYARSEAVNRHANVSIRATQNNDWAKGLEIISGATTVQKHQGFQQVSLSASSATAE 
LTFNATGTLSNQAANIDIKVCFAGDKSTGRLLTVQPSGRVILYPSSKQPDSCN* 

>ORF34726c (SEQ ID NO:385) 

RDLLETLVLLHGGRAADDFQAFCPVIVLRRPDAHIGMTVYSFAASVLKQAVEFLAGSLDSVALDQGVETGYGNAGEYGDQ 
NQHHHQLDQGKAGAATNRAHAVLHKEKGQIILPSGERIVPVAETAGAESGMTGKRYVKQFSHRLEHR* 

>ORF34916 (SEQ ID NO:387) 

GKPMSRETGFSMIEVLVALVLISIGVLGMVAMQGRTIQYTQESVQRNAAAMLASDLMEIMRADPDAVLNLRAQLREDSVY 
YKAKGSDFPAAPARCAPLPADAKERLGCWAQQASKDLPGASALLNSQFYICRSPTPGTCDNTKGSAIEIQVAWRAMDGAC 
FNASDSTLCTYSVRSEL* 

>ORF35464c (SEQ ID NO:389) 

RACLFSQFGADAVGAQGGVRGVETRSIHGSPGNLDFDGRAFGWAGTRGWAATNVELAIQECGGSRQVFRGLLGPAAETF 
LSICWQWRAARWGCGKVAALGLWDRVFA* 

>ORF35289 (SEQ ID NO:391) 

IANSTFVAAQPRVPATTPKARPSKSRLPGEPWMERVSTPLTPPCAPTASAPNCENKHALQQNAERPIDGRTARGTRYKQL 
PDPGDQPDLHRQQTQLSFPARPGRQPGK* 
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>ORF35410 (SEQ ID NO:393) 

LHLVHLQRPLRIVRTSMLFSKMQKGLSMVELLVALAISSFLILGISQIYIDNKRNYLFQQGQA6NQENSRFVLMLLQQQL 
DKTAYRRLHDDNMENAFKSATFNGCRAFVAGETIAAATALKAGEYGVCLRYQPAYKGEHDCLGNEITGVPEKPFTNTPPV 
WRLVYLPSAGTLSCSRPDIAQSKSGELVSGLTDFRLEAGVGPADRSERKVSSFVALQDVAGRPIRALRFSILAGSDNTS 
LRTGDDSQARDRWIVLYPESKSAIEAADKGQIYQIARGNQTIRNLMP* 

>ORF35907c (SEQ ID NO:395) 

VDQANDDRGSICEGLFRNSGNFITFAIMLPFVGGLIAQADTV^ 

TIGCLIQLLLQQHKNEAAIFLVAGLALLEKIVAFWDVDLADPQDQEAAYSECHEQFYHR* 

>ORF35534 (SEQ ID NO:397) 

SWGSARSTSTTNATIFSSKARPATRKIAASFLCCCSNMIRQPIVAFTTTTWRMLSNPRHSMAVVHLWLARLSLRQLPSR 
RVSTVSACAINPPTKGSMIASVMKLPEFRKSPSQILPLSSFAWSTYRAPVP* 

>ORF35930 (SEQ ID NO:399) 

VAWPISPSRNRENWSWSQTSAWKRGSGQQIVANAKYPASSHYRMSPWLSEHCASQSWQAATIQACAQEMIARHAIAG 
SSFIPRAKAPSRPQTKARFTK* 

>ORF36246 (SEQ ID N0:401) 

PNHQESHAMTLRHTSRQQGSTLLISLVILLMITLLAVSNMREVSLESRITGNLIEQKRLRNAGEAGLREGERRFFNTIKP 
PEVGSGCADSNVKRPCILNLSALSVPRDDVHl^VAALNGKTDNMSRVWMPYRGSDLNNPTQIDKDRAVTWQTITVPAG 
EQNNEAENPEYGNMMRGVGTFYYETNSRALNKAGGETVLQAVHARLYTN* 

>ORF26640c (SEQ ID NO:403) 

GIQTRELALSVLPFRAATGLLCTSSRGTERALRFSMQGRLTLLSAHPLPTSGGLMVLKKRRSPSRSPASPALRRRFCSMR 
LPVIRLSSDTSRMLETARSVIINKITSEINKVDPCCREVWRRVMA* 

>ORF36769 (SEQ ID NO:405) 

CAGSARSTTKPTAAPSTRRAERLFYRPFMHACIPTDWSQRMIHQITRAGKSLLAAGCTLSILFASDSYAATALNVSQQPL 
FLTQGVAPNLLFTLDDSGSMAWAYVPDGISGNSGRAGRSSDYNALYYNPDYAYQVPKKLTLSGDQIIVSDYPVPRFTAAW 
QDGYAQGSTTNLSNNYRPQWGTGWLGCIDSSCNTGRAYYYTYKVSASCPAQPVSSSNSCYTYNALPTSQESNFAIWYSYY 
RNRILATKTAMLAFYSLPE1WRLTWGALNTCSIGANSRSCQNNALLQFNKQHKINFFNWLANSPASGGTPLHAALDRAG 
RFLQTNGTAYTTEDGKTYSCRASYHIMMTDGIWNGRNVTPGNLDNQNQTFPDSTLYRPQPPYADSNASSLADLAFKYWTT 
DLRPSIDNDLKPFMAYKSGDDSKDYWDPRMPATWQHMVNFWGLGLSYSLTLNSAPTWTGSTFGNYEELMAGSKAWPSV 
DNDAAPGNVYDLWHAAINSRGDFFSAESPDSLVQAFNKILTRISERNTSSSKPAMTSALQDDGTGDKLIRYSYQSSFASD 
KNWAGDLIRYKVESTSTGSTKTQEWSAGALLDNRAPATRNIYIASNSGTNRLKPFTWSNIEGSQLATWLNRNPDKDNQAD 
TKGAQRVDFIRGQQNMDGFRQRQAVLGDIVHSSPAWGPAQYLTYLANPIEPSGDYGTFKTEADQRSPRVYVGSNDGMLH 
GFNIKTGVEEFAFIPTAVFEKIMLTGISYQGGMQYFVDATPWSDAFFDGAWHTVLIGTLGAGGRGLFALDVTKPDDV 
KLLWEYDSSTDSDLGYTFSKPTVARLHSGQWAVVTGNGYGSDNDKAALLLIDLKKGTLIKKLEVQSERGIANGLSTPRLA 
DNNSDGIADYAYAGDLQGNIWRFDLIGNTRNDDPDTNTSINPFKPGDVDPSAFRVSFSGAPLFRARADNNTRQPITAPPT 
LVRHPSRKGYIVIVGTGKYFEDDDAQADTSRAMTLYGIWDRQTKGESANSTPTIDRNALTAQTMTTEANSTFGSVNRNIR 
LISQNPVKWYKDGATGTANSDVASYGWRLNLEVNSSKKGEMMIEDMFAAGQVLLLQTLTPNDDPCDSGSTSWTYGLNPYT 
GGRTSFTVFDLKRAGIVDSGSDYNGSWSAFQQDGLGGLAITQNEQRQSEACTGDECIIFNPSDKSNGRQTWRWEEK* 

>ORF37932c (SEQ ID NO:407) 

AGIAVGIRGLWPIEGAIRKGLVLWEIAGGDVPTVPDTVGHHDVIAGPAGICLSVFGGISCAVGLQEASGSVKSRMQRST 
AAGRAVRQPIEEIDFVLLVELEQGIVLAASAVGADATGVQGPPSETHVFRQAVKGQVGSGLCGQDAVAIVGVPYRKVAFL 

TGRKSIVGITGVGAAHRLCRAASAYLISIVISSPGIAAAIDATKPAGSPLRAIVIAQVGGGALGVAILPGCCEAWHWIVG 
NDDLIA* 
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>ORF38640c (SEQ ID NO:409) 

LTSLNIAPCERLKAVGSAIAGDVNITGS6SSWQQCA6APFL6F6RTGGSRLHLVTYKVARPVLVTGKAGLVAVADQLVA 
GSVILQRGSHCWFGGGGVALGNPCQDLIEStNQRVR* 

>ORF39309c (SEQ ID N0:411) 

SCLVIIASIAVAGNYCPLSAVQSGYGRFGEGVTKVRVGTAIVFPKQLDIVRLGYIECEQAATSSTKRSDQNSVPSSIEKG 
IADDRCSVDEILVGTALVADAGKLVKLFEYCCRDESELFHAGFDVETMQHAIVGSNINSRAALVCLCLECAWAAGFDGV 
GQISEVLGRSDHGWRRVHDVP* 

>ORF38768 (SEQ ID NO:413) 

GTSCTRLQPWSDRPNTSLIWPTPSNPAATTAHSRQRQTSAALEFMLDPTMACCM7STSKPAWKSSLSSLQQYSKSLTSLP 
ASATRAVPTNISSTLHRSSAMPFSMELGTLF* 

>ORF40047c (SEQ ID NO:415) 

KHLAGSEHIFDHHFAFLAAIDLQIQSPAIASHIRVRGTGCSVFVPLHRVLANKPNIPVHATECGVRLCCHGLGCEGVAVD 
GWGTVCAFALGLAIPDTIERHGSAGIGLSVIVLEVFSCTYDDDVALTARMAYQGRRSRDGLTSIWGASTEKRGAAERYS 
ESRRIYISGLEGIDRGICVWWVAGIADQIEAPDISLQIASIGWSNAIAWISQARRR* 

>ORF40560c (SEQ ID NO:417) 

PASPWGHGWASDMSAHRLLPDRRQDEHPRPRPHSRTYSWHSRQRLGKPELELQELWREDVRGSCSFLLDDPPGLSSVTL 
VAGVEDDALITSASLGLTLFVLGNGQAT* 

>ORF40238 (SEQ ID NO:419) 

VAWPLPRTNSVNPRLALVMSASSSTPATRVTDDKPGGSSRRNEHEPLTSSRHSSCSSSSGLPNLCLECHEYVRECGRGRG 
CSSCRRSGSSRWADISLAQPCPTTGLAGHILGTSGTDSVFLRQTHQRPARNRWLHYQAGPSRSLRIGAATMKSNRGFTL 
IELMIVWIIAIIAGIAYPSYDEmR(MTEGQALLSEmTQERYFSQNNTYITTQADIGKLHMROTSGTTVKSSTGK 
YSLTVDTVANDGGYRLIANQAFNDLDCGNLTLTANGEKGRTGSKKSVAECWR* 

>ORF40329 (SEQ ID NO:421) 

RTTNIAGRRGEMNMPLRLLATALAALALACPTFALSATiraFENVGWEDVHPAAGLVVVDGQTYRLPNRVQQQDS 
LVRQGQTVSFSGKLTSDLPEIESFYIIKQAPLVPFGSEQQQ* 
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>ORF40709c (SEQ ID NO:423) 

SLCSTSLLLLRSEGNER6LLDNVERLDFWQVAGEFAGERHCLSLTYQEYDRRVLLLDTVGQAICLPIDYYQTGGRMNILD 
HAHI LERIRGTQGKGWASQS * 

>ORF40507 (SEQ ID NO:425) 

SMGRHIACPTVSNNRTRRSYSWYVRDRQCLSPANSPATCQKSSRSTLSSRPLSFPSDRSSNNEVEQRLHSHRVDDRRSNH 
RYSCWYRLPQLRRIREARESHRRTGITQRSSRYSRALFFTEQYLYHYPSRHRQAAYAQHIGHHSEVLHRQIQPYRRYGSQ 
RRRLSPYR* 

>ORF41275c (SEQ ID NO:427) 

VGGVGRAGTGGTGGEHYLLDAQGTAGQGFVIGIFAHLKKPGLCRALPFSCLGALAPAFCNALLASSPAFLAVGGQGQVAT 
IKIVECLVSDKAITSWGYRIDGKAVFACGGLHCGARCVAHMQLADVGLGSDISIVL* 

>ORF42234c (SEQ ID NO:429) 

STSSRPEPSVMPFPSGEGGSKVHSSNHRVMETMQIKLMPRGFCAGVDRAIEIVNRALDVFGPPIYVRHEVVHNKFV 
VDNLRQRGAIFVEELDQVPDNVIVIFSMGVSQAVRKEAEGRGLKVFDATCPLVTKVHMEVWYSRDGHECVLIGHEGHP 
EVEGTMGQYDASNGGAIYLVEDEADVAALEVRKPEALHYVTQTTLSMDDTSKVIDALRAKFPQIQGPRKNDICYATQNRQ 
DAVKELADQCDMVLWGSPNSSNSNRLRELAERMGTPAYLIDGAEDMQRGWFDGVRRIGITAGASAPEVLVRGVIAQLRE 
WGASEEQELEGREENITFSMPKELRVKAL* 
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>ORF41764c (SEQ ID NO:431) 

RPPRGGRHHGPVRCQQRRCHLPGG6RGRRRRAGGAQARSPALRDPDHPVDGRHLEGHRCPARQVPADP6AAQERHLLCHP 
EPPGCREGTGRPVRHGPGGGQPQQFQLQPPARTRRAHGHAGLPDRRRRGHATRLVRRCASHRNHRRRLRAGSAGARSDRP 
AT* 

>ORF41284 (SEQ ID NO:433) 

LGDHSAHQHFRRGGACGDSDATHTVEPAAI^LGAVDQVGRRAHALGEFAQAVGVGTVGAAHHQDHVALVGQFLHGILAV 
LGGIADVVLARPLDLRELGAQGIDDLRGVVHRQGGLGHVVQGFGLAHLQRGDVGLVLHQVDGTAVAGIVLAHGAFHLGVA 
FMPDQHAFVAVAAVAHHFHVHLGHQRAGRVENLQAAPLGFLADRLGNAVGAEDDDDVVRHLIEFLDEDGAALAQVVHDEL 
VVHHLVTHVDRRAEDIEGTVDDLDGAIHAGAEAAGIGEFDLHGGLVGDAVIGRMNLATALPAWEGRSDRRFRPAGRR 

>ORF41598 (SEQ ID NO:435) 

PSRCRPSTGWSGSRSAGLRACAPPARRRRPRPPPGRWHRRCWHRTGPWCLPPRGGLHARSARIRGRRGCSAPLPCAPWSP 
AGRSRRKPSGRAPRLPCGPPGKRRGR* 

>ORF42172c (SEQ ID NO:437) 

QGSFVQSPRRPRDRHANQTRQSPRLLRRRGSRHRDRQPCPRCLRPADLRASRGGAQQVRRGQPAPARRHLRRGTRSGAGQ 
RHRHLQRPRRFPGGPQGSRGARPEGFRRDLPAGDQGAHGSGALQPRRPRMRADRA* 

>ORF42233c (SEQ ID NO:152) 

RRPAGLNRRSLRPSQAGRAVARFIRPITASPTRPPCKSNSPIPAASAPAWIAPSRSSTVPSMSSARRSTCVTRWCTTSSS 
WTTCASAAPSSSRNSIRCRTTSSSSSAPTAFPRRSARKPRGAA* 
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33A9 (SEQ ID NO:102) 

CAAAGCATAA6ACCAA6AT6GCACATT6CCAACAAAATACCCTTCCCCGCTACC6TT6TTTTATC6TTGTT6CCAGCCCT 
GATCTGGCGGAAAAGCCCGCTCCATGAATCGTCATGGAGCCTCCQTGTTTCAACTCCTTTCCTGGATATCCAGGAAGCC 
GTCCCCCACCCCMCAACCAAAGCTGCCCC^ 

GCCGCCGGCAGCTACTGGAGAACATCTGGCAGCGCGCCTCGCTATCCAAGCAGCAATTCGAGGAGATCTACCGGCGGCCA 
CTGGCCAACTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGA 
TCACGGCCTGGAGATCGTGGCCTACGCACTCAAGGTACGGCAGACCTACCTGCTCCCGATCGGCGCAGCGCCGGAGTCAC 
AGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCGTCGTCGAC 
CTGCAGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGT 
GAAGTCCCGCGAATACCAGCTCCACGGCGCTGCCTCAGCACTTCTCATCCACCAACTGCTACCGCGCACTGCACTCGATT 
GGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGC 
GAGATCATCGTGAAGGCAGACCAGGCCTCAGTTGCACAGGAGCTAGGAGGCAATCCGGATCGAGCTCTGGCTGCACCGAA 
GCAGTCGCTGCAGCGGCAGTTGGCAGACGGCCnCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCAACCTAGCGGCC 
CGTCTGATGGATGGCTGACCCAGGACGCACTCTGGCTGGTGAGCAAGCCTGCTGCCGATCAACTGAGAGCCTACCTGCTG 
GCCCAGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATCCAGACAAA 
TGCCGAGGACMGGCCATTTGGACGGC(^CGGTAGA(^CGGTGCTGGATGGAGAAACAAGTTCACGCTACTCAAGATTG 
CTCCAGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCTACAGCGGATCACTGGTCGTTGAAGATGGAACCGCC 
TCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACGAAGATGAT 
GCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAGGCGATTGCGAAACCCTCAACTGATGATCAAG 
AAGAAACAGACGATTTGTATGCACTTCTTGGTAA^ 

GCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCMCAGATTGCGCTCC 
TGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCA 
TCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGCCA 
GGAATTTTCMGCGCTATGTCCAAGAGCATCCGGTGCTK 

GGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTCTG 
GTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGGACAAC 
CCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGACGCCGCAGCAGCTCACCGAGGAGTACATCTTCGCG 
CACGATCTCCGGGAAGCCAGC 



Fig. 5 
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33A9 SEQ ID N0:103 

1 MNRHGASHVS TPFLDIQEAV PHPNNQSCPR GIHPSSEQHG TARHASPPAA 
51 TGEHLAARLA IQAAIRGDLP AATGQLCGVG PAxPCFGKSS PCPSRRDDRS 
101 RPGDRGYALK VRQTYLLPIG AAPESQSAQA EAWSAAAAYG ALAHDIGKIV 
151 VDLQVELQDG STWHPWNGPI NQPYRPKYVK SREYQLHGAA SALFIHQLLP 
201 RTALDWLSRF PELWAQLIYL FAGQYEHAGI LGEIIVKADQ ASVAQELGGN 
251 PDRALAAPKQ SLQRQLADGL RFLVKDKFKL NQPSGPSDGW LTQDALWLVS 
301 KPAADQLRAY LLAQGIDGVP SSNAPFFSML QDQAVIQTNA EDKAIWTATV 
351 DNGAGWRNKF TLLKIAPALI WTDAAERPSP YSGSLWEDG TASTEKPETT 
401 CEIPNGPAEQ QQAPETKMML HQPAPSVAKP ANETQAIAKP STDDQEETDD 
451 LYALLGNINS PLEELDTSHD SPAASPTNTR GEENLQQPLG TKEPTDCAPE 
501 AIEDVFMPSR STDLGQGFVG WMKSGIAARR LFINDTKALV HTVDGTAMLV 
551 TPGIFKRYVQ EHPVLEKLAQ AKETTGWKLV QRAFEKQGLH RKTSKNLNIW 
601 TIKVSGPRKT KELKAYLLQD PKLLFPEQPL DNPSLTVITD AE* 
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33A9-ORF1 SEQ ID NO: 189 

AT6GA6CCTCCCATGTTTCAACTCCTTTCCTG6ATATCCA6GAAGCCGTCCCCCACCCCAACAACCAAAGCT6CCCCAGG 
GGGATTCATCCTTCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTCGCCGCCGGCAGCTACTGGAGAACATCTGGCAGC 
GCGCCTCGCTATCCAAGCAGCAATTCGAGGAGATCTACCGGCGGCCACTGGCCAACTATGCCGAGCTGGTCCAGCAGCTC 
CCTGCTTCGGAAAATCATCACCATGCCCATCCAGGCGGGATGATCGATCACGGCCTGGAGATCGTGGCCTACGCACTCAA 
GGTACGGCAGACCTACCTGCTCCCGATCGGCGCAGCGCCGGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCG 
CGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCGTCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGCACCTGG 
CACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAGTCCCGCGAATACCAGCTCCACGGCGCTGC 
CTCAGCACTTCTCATCCACCAACTGCTACCGCGCACTGCACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAAT 
TGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGCGAGATCATCGTGAAGGCAGACCAGGCCTCAGTT 
GCACAGGAGCTAGGAGGCAATCCGGATCGAGCTCTGGCTGCACCGAAGCAGTCGCTGCAGCGGCAGTTGGCAGACGGCCT 
TCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCAACCTAGCGGCCCGTCTGATGGATGGCTGACCCAGGACGCACTCT 
GGCTGGTGAGCAAGCCTGCTGCCGATCAACTGAGAGCCTACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCCTCTAAC 
GCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGT 
AGACAACGGTGCTGGATGGAGAAACAAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGC 
GCCCCTCACCCTACAGCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATT 
CCCAACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACGAAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACC 
GGCAAACGAGACGCAGGCGATTGCGAAACCCTCAACTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTA 
ATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAG 
AACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGATTGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAG 
AAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCA 
AGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCG 
GTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCA 
TCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACC 
TGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGA 
GGTGTGGAATGA 
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33A9--ORF2 SEQ ID NO:190 

ATGTTTCAACTCCTTTCCTGGATATCCAGGAA6CCGTCCCCCACCCCAACAACCAAA6CTGCCCCAG66GGATTCATCCT 
TCCTCTGAGCAGCATGGAACTGCTCGGCACGCCTCGCCGCCGGCAGCTACTGGAGAACATCTGGCAGCGCGCCTCGCTAT 
CCAAGCAGCAATTCGAGGAGATCTACCGGCGGCCACTGGCCAACTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAA 
AATCATCACCATGCCCATCCAGGCGGGATGATCGATCACGGCCTGGAGATCGTGGCCTACGCACTCAAGGTACGGCAGAC 
CTACCTGCTCCCGATCGGCGCAGCGCCGGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCG 
CCCTGGCTCATGACATAGGCAAGATCGTCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAAC 
GGACCGATCAACCAGCCATACCGCTTCAAGTACGTGAAGTCCCGCGAATACCAGCTCCACGGCGCTGCCTCAGCACTTCT 
CATCCACCAACTGCTACCGCGCACTGCACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGT 
TCGCTGGGCAGTACGAGCACGCCGGGATCCTCGGCGAGATCATCGTGAAGGCAGACCAGGCCTCAGTTGCACAGGAGCTA 
GGAGGCAATCCGGATCGAGCTCTGGCTGCACCGAAGCAGTCGCTGCAGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGT 
GAAGGACAAGITCAAGTTGAATCAACCTAGCGGCCCGTCTGATGGATGGCTGACCCAGGACGCACTCTGGCTGGTGAGCA 
AGCCTGCTGCCGATCAACTGAGAGCCTACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTC 
AGCATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGC 
TGGATGGAGAAACAAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCT 
ACAGCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCG 
GCTGAACAGCAGCAAGCACCAGAAACGAAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGAC 
GCAGGCGATTGCGAAACCCTCAACTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGC 
CACTAGAAGAGCTAGACACTAGCCACGACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAG 
CCACTAGGGACCAAGGAGCCAACAGATTGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCT 
GGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGC 
ATACCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAA 
CTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAG 
TAAAAACCTGAACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATC 
CCAAATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 



Fig. 6C 



80/133 

ORF3— 33A9 SEQ ID NO:191 

ATGGAACTGCTCGGCAC6CCTC6CC6CC66CAGCTACTGGAGAACATCT6GCAGCGCGCCTC6CTATCCAAGCAGCAATT 
CGAGGAGATCTACCGGCGGCCACTGGCCAACTATGCCGAGCTGGTCCAGCAGCTCCCTGCTTCGGAAAATCATCACCATG 
CCCATCCAGGCGGGATGATCGATCACGGCCTGGAGATCGTGGCCTACGCACTCAAGGTACGGCAGACCTACCTGCTCCCG 
ATCGGCGCAGCGCCGGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCGCCCTGGCTCATGA 
CATAGGCAAGATCGTCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACC 
AGCCATACCGCTTCAAGTACGTGAAGTCCCGCGAATACCAGCTCCACGGCGCTGCCTCAGCACTTCTCATCCACCAACTG 
CTACCGCGCACTGCACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTA 
CGAGCACGCCGGGATCCTCGGCGAGATCATCGTGAAGGCAGACCAGGCCTCAGTTGCACAGGAGCTAGGAGGCAATCCGG 
ATCGAGCTCTGGCTGCACCGAAGCAGTCGCTGCAGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGTGAAGGACAAGTTC 
AAGTTGAATCAACCTAGCGGCCCGTCTGATGGATGGCTGACCCAGGACGCACTCTGGCTGGTGAGCAAGCCTGCTGCCGA 
TCAACTGAGAGCCTACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTCAGCATGCTCCAGG 
ACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGCTGGATGGAGAAAC 
AAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCTACAGCGGATCACT 
GGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCTGAACAGCAGC 
AAGCACCAGAAACGAAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAGGCGATTGCG 
AAACCCTCAACTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCT 
AGACACTAGCCACGACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCA 
AGGAGCCAACAGATTGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTC 
GTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGG 
GACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCA 
AGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAAC 
ATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTT 
CCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 
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ORF4-33A9 SEQID NO:192 

AT6ATCGATCACGGCCT6GAGATC6T66CCTACGCACTCAAGGTAC6GCAGACCTACCTGCTCCCGATCGGCGCAGCGCC 
GGAGTCACAGTCAGCCCAGGCTGAAGCCTGGTCGGCCGCCGCGGCGTATGGCGCCCTGGCTCATGACATAGGCAAGATCG 
TCGTCGACCTGCAGGTTGAGCTACAGGACGGCAGCACCTGGCACCCTTGGAACGGACCGATCAACCAGCCATACCGCTTC 
AAGTACGTGAAGTCCCGCGAATACCAGCTCCACGGCGCTGCCTCAGCACTTCTCATCCACCAACTGCTACCGCGCACTGC 
ACTCGATTGGCTCAGTCGCTTTCCAGAGCTGTGGGCTCAATTGATCTACCTGTTCGCTGGGCAGTACGAGCACGCCGGGA 
TCCTCGGCGAGATCATCGTGAAGGCAGACCAGGCCTCAGTTGCACAGGAGCTAGGAGGCAATCCGGATCGAGCTCTGGCT 
GCACCGAAGCAGTCGCTGCAGCGGCAGTTGGCAGACGGCCTTCGCTTCTTGGTGAAGGACAAGTTCAAGTTGAATCAACC 
TAGCGGCCCGTCTGATGGATGGCTGACCCAGGACGCACTCTGGCTGGTGAGCAAGCCTGCTGCCGATCAACTGAGAGCCT 
ACCTGCTGGCCCAGGGTATCGATGGGGTGCCCTCCTCTAACGCGCCGTTCTTCAGCATGCTCCAGGACCAAGCCGTCATC 
CAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGCTGGATGGAGAAACAAGTTCACGCTACT 
CAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCTACAGCGGATCACTGGTCGTTGAAGATG 
GAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCTGAACAGCAGCAAGCACCAGAAACG 
AAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAGGCGATTGCGAAACCCTCAACTGA 
TGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACG 
ACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGAT 
TGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAA 
ATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGG 
TCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGC 
TGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAA 
GGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTC 
TGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6E 

ORF5-33A9 SEQ ID NO:193 

ATGCTCCAGGACCAAGCCGTCATCCAGACAAATGCCGAGGACAAGGCCATTTGGACGGCCACGGTAGACAACGGTGCTGG 
ATGGAGAAACAAGTTCACGCTACTCAAGATTGCTCCAGCCTTGATCTGGACAGATGCTGCCGAGCGCCCCTCACCCTACA 
GCGGATCACTGGTCGTTGAAGATGGAACCGCCTCAACGGAAAAGCCGGAAACGACCTGTGAAATTCCCAACGGGCCGGCT 
GAACAGCAGCAAGCACCAGAAACGAAGATGATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCA 
GGCGATTGCGAAACCCTCAACTGATGATCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCAC 
TAGAAGAGCTAGACACTAGCCACGACTCGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCA 
CTAGGGACCAAGGAGCCAACAGATTGCGCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGG 
ACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATA 
CCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTG 
GCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAA 
AAACCTGAACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCA 
AATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 



Fig. 6F 
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ORF6-33A9 SEQ ID NO:194 

AT6ATGCTCCATCAACCT6CGCC6AGCGTT6C6AAACCG6CAAACGA6AC6CAGGCGATT6CGAAACCCTCAACTGATGA 
TCAAGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACT 
CGCCGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGATTGC 
GCTCCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATC 
TGGCATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCA 
CGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGG 
AAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGT 
TTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGG 
ACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6G 

ORF7--33A9 SEQ ID NO:195 

ATGCTCCATCAACCTGCGCCGAGCGTTGCGAAACCGGCAAACGAGACGCAGGCGATTGCGAAACCCTCAACTGATGATCA 
AGAAGAAACAGACGATTTGTATGCACTTCTTGGTAATATCAATTCGCCACTAGAAGAGCTAGACACTAGCCACGACTCGC 
CGGCTGCCTCTCCTACGAACACACGCGGGGAGGAGAACCTACAGCAGCCACTAGGGACCAAGGAGCCAACAGATTGCGCT 
CCTGAAGCAATTGAAGATGTATTTATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGG 
CATCGCGGCCCGTCGCCTGTTCATCAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGC 
CAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAG 
CTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTC 
TGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGGACA 
ACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6H 

ORF8-33A9 SEQ ID NO:196 

ATGCCTAGCAGAAGTACTGATCTGGGACAGGGATTCGTTGGTTGGATGAAATCTGGCATCGCGGCCCGTCGCCTGTTCAT 
CAACGACACCAAGGCTTTGGTGCATACCGTAGACGGGACCGCCATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCC 
AAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAA 
CAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCT 
CAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCG 
ATGCCGAAGGAGGTGTGGAATGA 



Fig. 61 
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ORF9-33A9 SEQ ID NO:197 

AT6AAATCTG6CATCGC66CCC6TCGCCT6TTCATCAACGACACCAA66CTTT66T6CATACC6TAGAC6GGACCGCCAT 
GCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGACGA 
CCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGACC 
ATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAGCA 
GCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6J 

ORF10-33A9 SEQ ID NO:198 

ATGCTGGTCACGCCAGGAATTTTCAAGCGCTATGTCCAAGAGCATCCGGTGCTTGAAAAACTGGCCCAAGCCAAGGAGAC 
GACCGGCTGGAAGCTGGTGCAGCGCGCGTTCGAAAAACAGGGGCTTCATCGGAAGACCAGTAAAAACCTGAACATCTGGA 
CCATCAAGGTTTCTGGTCCTCGCAAGACGAAAGAGCTCAAGGCCTACCTGCTCCAGGATCCCAAATTGCTGTTCCCTGAG 
CAGCCTCTGGACAACCCAAGCCTCACGGTCATCACCGATGCCGAAGGAGGTGTGGAATGA 

Fig. 6K 

ORF 1-33A9 SEQ ID NO:199 

MEPPMFQLLSWISRKPSPTPTTKAAPGGFILPLSSMELLGTPRRRQLLENIWQRASLSKQQFEEIYRRPLANYAELVQQL 
PASENHHHAHPGGMIDHGLEIVAYALKVRQTYLLPIGAAPESQSAQAEAWSAAAAYGALAHDIGKIWDLQVELQDGSTW 
HPWNGPINQPYRPKYVKSREYQLHGAASALLIHQLLPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVKADQASV 
AQELGGNPDRALAAPKQSLQRQLADGLRFLVKDKFKLNQPSGPSDGWLTQDALWLVSKPAADQLRAYLLAQGIDGVPSSN 
APFFSMLQDQAVIQTNAEDKAIWTATVDNGAGWRNKFTLLKIAPALIWTDAAERPSPYSGSLWEDGTASTEKPETTCEI 
PNGPAEQQQAPETKMMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEE 
NLQQPLGTKEPTDCAPEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHP 
VLEKLAQAKETTGWKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEG 
GVE. 

Fig. 6L 



ORF2-33A9 SEQ ID NO:200 

MFQLLSWISRKPSPTPTTKAAPGGFILPLSSMELLGTPRRRQLLENIWQRASLSKQQFEEIYRRPLANYAELVQQLPASE 
NHHHAHPGGMIDHGLEIVAYALKVRQTYLLPIGAAPESQSAQAEAWSAAAAYGALAHDIGKIWDLQVELQDGSTWHPWN 
GPINQPYRFKYVKSREYQLHGAASALLIHQLLPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVKADQASVAQEL 
GGNPDRALAAPKQSLQRQLADGLRFLVKDKFKLNQPSGPSDGWLTQDALWLVSKPAADQLRAYLLAQGIDGVPSSNAPFF 
SMLQDQAVI QTNAEDKAIWT ATVDNGAGWRNKFTLLKIAPAL IWTDAAERPS P YSGSLWEDGTAS TEKPETTCEI PNGP 
AEQQQAPETKMMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQ 
PLGTKEPTDCAPEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEK 
LAQAKETTGWKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6M 
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ORF2-33A9 SEQ ID NO:201 

MELLGTPRRRQLLENIWQRASLSKQQFEEIYRRPLANYAELVQQLPASENHHHAHPGGMIDHGLEIVAYALKVRQTYLLP 
IGAAPESQSAQAEAWSAAAAYGALAHDIGKIWDLQVELQDGSTWHPWNGPINQPYRFKYVKSREYQLHGAASALLIHQL 
LPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVKADQASVAQELGGNPDRALAAPKQSLQRQLADGLRFLVKDKF 
KLNQPSGPSDGWLTQDALWLVSKPAADQLRAYLLAQGIDGVPSSNAPFFSMLQDQAVIQTNAEDKAIWTATVDNGAGWRN 
KFTLLKIAPALIWTDAAERPSPYSGSLWEDGTASTEKPETTCEIPNGPAEQQQAPETKMMLHQPAPSVAKPANETQAIA 
KPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQPLGTKEPTDCAPEAIEDVFMPSRSTDLGQGF 
VGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTGWKLVQRAFEKQGLHRKTSKNLN 
IWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6N 

ORF4-33A9 SEQ ID NO:202 

MIDHGLEIVAYALKVRQTYLLPIGAAPESQSAQAEAWSAAAAYGALAHDIGKIWDLQVELQDGSTWHPWNGPINQPYRF 
KYVKSREYQLHGAASALLIHQLLPRTALDWLSRFPELWAQLIYLFAGQYEHAGILGEIIVKADQASVAQELGGNPDRALA 
APKQSLQRQLADGLRFLVKDKFKLNQPSGPSDGWLTQDALWLVSKPAADQLRAYLLAQGIDGVPSSNAPFFSMLQDQAVI 
QTNAEDKAIWTATVDNGAGWRNKFTLLKIAPALIWTDAAERPSPYSGSLWEDGTASTEKPETTCEIPNGPAEQQQAPET 
KMMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQPLGTKEPTD 
CAPEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTG 
WKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 60 

ORF5-33A9 SEQ ID NO:203 

MLQDQAVIQTNAEDKAIWTATVDNGAGWRNKFTLLKIAPALIWTDAAERPSPYSGSLWEDGTASTEKPETTCEIPNGPA 
EQQQAPETKMMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQP 
LGTKEPTDCAPEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKL 
AQAKETTGWKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 



Fig. 6P 
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ORF6-33A9 SEQ ID NO:204 

MMLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQPLGTKEPTDC 
APEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTGW 
KLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6Q 

ORF7-33A9 SEQ ID NO:205 

MLHQPAPSVAKPANETQAIAKPSTDDQEETDDLYALLGNINSPLEELDTSHDSPAASPTNTRGEENLQQPLGTKEPTDCA 
PEAIEDVFMPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTGWK 
LVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6R 

ORF8-33A9 SEQ ID NO:206 

MPSRSTDLGQGFVGWMKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTGWKLVQRAFEK 
QGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6S 

ORF9--33A9 SEQ ID NO:207 

MKSGIAARRLFINDTKALVHTVDGTAMLVTPGIFKRYVQEHPVLEKLAQAKETTGWKLVQRAFEKQGLHRKTSKNLNIWT 
IKVSGPRKTKELKAYLLQDPKLLFPEQPLDNPSLTVITDAEGGVE. 

Fig. 6T 

ORF10--33A9 SEQ ID NO:208 

MLVTPGIFKRYVQEHPVLEKLAQAKETTGWKLVQRAFEKQGLHRKTSKNLNIWTIKVSGPRKTKELKAYLLQDPKLLFPE 
QPLDNPSLTVITDAEGGVE. 

Fig. 6U 
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Sequence: 34B12 EcoRl fragment From: 1 To: 4590 

10 20 30 40 50 60 

i i i i I i i i i 1 i i i i I i i i i 1 i i i i I i i i i I i i i [ I i i i i I i i i i I i i i i I i i i i I i i i i 1 

GAATTCCAT6 GCGCCGTGGA GGAGGCTTCC GAGTCGCCGG TGGCAGGCGT ACGGGCCGGC 60 
AACTACCAGG TCGACCTGGA CGATGCGAGC TTTGCCCGCC AGGTAGAACG CCTGCAGGCC 120 
CACGTGAGGG CCGGCGACGT GTTCCAGATC GTACCTTCGC GCAGCTTCAG CATGCCGTGC 180 
GCGGACCCCT GGCGGGCCTA TCGCCAGTTG TGCCTGCGCA ACCCCAGCCC GTACCGCTTC 240 
TTCCTCGATG CGGGGGACTT CTGCCTGTTC GGCGCTTCGC CGGAGTCGGC ATTGAAGTAC 300 

310 320 330 340 350 360 
i i i i I i i i i I 1 1 i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i 1 1 

GACGCGGAGA GTCGCGAGGT GGAACTCTAT CCCATTGCCG GCACCCGCCC GCGCGGATGC 360 
GATGCCCGGG GCGCCATCGA TGCGGAACTG GACAATCGCC TGGAAGCGGA GTTGCGCCTG 420 
GATGCCAAGG AGATCGCCGA GCACATGATG CTGGTCGACC TGGCGCGCAA CGATCTGGCG 480 
CGCGTCTGCC GCAGCGGTAC CCGGCAGGTG CGCGACATGC TCAAGGTCGA TCGCTACAGC 540 
CACGTGATGC ACCTGGTCTC GCGCGTGGCT GGCGAACTGC ACGGCGAACT GGATGCGCTG 600 

610 620 630 640 650 660 

i i i i I i ii i I i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I 

CATGCCTACC GTGCCTGCCT GAACATGGGC ACCCTGGTCG GCGCGCCGAA GGTCCGTGCC 660 
ATGCAGTTGC TGCGGCAGTA CGAGGATGGC TATCGCGGCA GCTACGGTGG TGCGATCGGC 720 
ATTCTCGACA GCGCCGGCAA CCTCGATACC AGCATTGTCA TCCGCTCCGC CGAGGTCCGC 780 
GAAGGTATCG CGCGGGTTCG GGCAGGCGCC GGCGTGGTGC TGGATTCGGA TCCACGGCTG 840 
GAGGCCGAGG AAACCCGCAA CAAGGCGCTG GCGGTGCTGA CCGCCGTGGC CGCTGCCGAA 900 

910 920 930 940 950 960 

i i i i 1 i i i i I i i i i I i i i i I i i i i I i i i i i i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 

CGCGAAAGGG GAGAGCGCGA TGCGCATCAC GCTGTTGGAT AACTTCGATT CCTTCACCTA 960 
CAACCTGGTC GAGCAGTTCT GCCTGCTCGG CGCGGAGGTC CGGGTGATGC GCAACGATAC 1020 
GCCGTTGCCG ACGATCCAGG CGGCATTGCT GGCCGACGGT TGCGAACTGC TGGTGCTGTC 1080 
GCCGGGGCCC GGTCGGCCGG AAGACGCCGG CTGTATGCTG GAATTGCTCG CCTGGGCCCG 1140 
CGGGCGCTTG CCGGTGCTCG GCGTCTGCCT CGGCCACCAG GCGCTGGCGC TGGCCGCCGG 1200 

1210 1220 1230 1240 1250 1260 

i i ii I i i i i I i i i i 1 1 i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I 

TGGCGCGGTG GGCGAGGCGA GGAAGCCGCT GCATGGCAAG AGCACGTCCC TGCGTTTCGA 1260 
TCAGCGTCAC CCGCTGTTCG ACGGCATCGC TGACCTGCGC GTCGCGCGCT ACCACTCGCT 1320 
GGTGGTCAGT CGCCTGCCGG AAGGTTTCGA CTGCCTGGCC GATGCCGATG GCGAGATCAT 1380 
GGCGATGGCC GATCCGCGCA ATCGACAGCT GGGCTTGCAA TTCCATCCCG AGTCGATTCT 1440 
CACCACCCAC GGCCAGCGTC TGCTGGAGAA CGCTCTACTC TGGTGCGGCG CGTTGGCGGT 1500 



Fig. 7B 
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Sequence: 34B12 EcoRl fragment From: 1 To: 4590 

1510 1520 1530 1540 1550 1560 

i i i i I i i i i I i i i i I i i i 1 1 i i i i 1 1 i 1 1 I 1 1 1 i 1 1 1 1 1 1 M i 1 1 1 1 1 i I 1 1 1 1 1 1 i i i I 

CGCG6AGCGC CTTCGGGCCT GAGCGGCGCT GCGCAGTTTC GACCGAGGCT CGGTTGCCAG 1560 
GCCGGCGCAT CGTCGAAACG CTGGCGGCCC AGTTCGCGCA GGCGCTGGCG GGCGCTTTCG 1620 
AGAAAGCGAC GGAAGCTGCG CTCGGATTCC AGCGCGGTGT TGTAGTAGCA ATACACCTTG 1680 
GTGTCGATGC CGCCCGGTTC GTACAGTTCG CTGAGGACTG CCAGGGTACC GTTGCGCAGG 1740 
CGTTCCTCGA CGAAATAATG CGGCGAGATG CCCCATCCGA CGCCGGCTTC CACCAGACGC 1800 

1810 1820 1830 1840 1850 1860 

i i i i I 1 1 i i I i i i i i i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i 1 1 i i i i I i i i i I i i i i I 

AGCATGTCGT CGAAGTTTTC CACGAAGAGC ACCTTGTCGC TGACCGGCCG CAGCAGGTTC 1860 
GAATGCTGCC CGGAGCGGCT GCCGAGGCTG ATCTGCCGGT AATTGGCCAG GCTCGCGATG 1920 
CTGTGCAGGG AGGCATTGCA CAACGGGTGC TGCGGATGGG CGACGACGAA CGCCTTGGTG 1980 
TAGCCGAGCA CGCACTGGTT GAAGCGGGAG ATCTTCAGTT CCTCGTCGAT GGTGATGGCG 2040 
ATATCGATTT CCGCGTTGTC CTGCTTGATC GTCGCCAGGC TATCGGCGGG CGAGGTGCGT 2100 

2110 2120 2130 2140 2150 2160 

i i i i I 1 1 i 1 1 1 1 i i I i i 1 1 I i i i i I i 1 1 i I i i i i I 1 1 1 i I i i i i 1 1 i i i I i i i i 1 1 i i i I 

ATCAGGCTGA CCATGTTGAA ATCGTCGAGC AGTACGCTGC TCACCGTATC GCAGAACGAC 2160 
GGCGGGATGG CGGTGTCCAG CAGCACCCGG AGATTGCGCG GACCCTTGTT GAGATTGAAG 2220 
GCGATGTCGC CGATCAGCTG CTGGTAGTTC AGCAGGCTGC GCATGTAAGG GATCAGGCGA 2280 
AGCGCCTGCT CGGTGGGTTC GACCTTGTAG CCGTCCCGAC GGACCAGCTC CACGCACAGG 2340 
TCGATTTCCA GGTTGCTGAC CGCCGAGCTG ACCGCGGTGT GCGACTTGCG CAGGATCCGC 2400 

2410 2420 2430 2440 2450 2460 

ii i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i 1 i i i i I i i i i I 

GCAGCGGAGG AAATCGAACC GGAGGCGATG ACCTGGAGGA ACATGTTCAC GTGATTCAGG 2460 
TTATGAATAG GCATCCCTTA TTCCTTTTAT TGGGTGGCGC GTGCCGCTTC CCTTGATCGG 2520 
GTCAGGTTGC CGCTACTGTG GAAGAAGCGT CGAGGACTCG ATAGATAGCG CCCGAGTGTT 2580 
TCAACTTGTC TTCTGGATGA CGTTTTCATC GGGGAAACCT CCCGTCGGTC AGTGAATCGC 2640 
AAGGGCTGGC GTGCGAGGGT GGAATCGGCC GCCGGCTCGC TTTCTGCGCG GCGGGCGCAC 2700 

2710 2720 2730 2740 2750 2760 

i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I 

GGCACGGGGA GTCGTCGTTT TGGAGGTGAG GGATGACGGC TCTGTTTCAG GATATTTTTA 2760 
TAATTATGTG AAAGAAGAGC TTATTTCAAC GAAATATGTT TCATATTGCT CGTTAAATTC 2820 
GACGAAAAGA AAATCCGGAT ATTTACCGGT TATTTAACGC TAATACCAAG TGCCTAATAC 2880 
CAAAGTATTA ACGCTGGTAT GCCGGCATGT CGTGTTCGGT CGTGGAGCGA GCCGAGCTAG 2940 
GGACGGTTCT AATAAACCAA AAAATTATGT CGCGTACGTC TAACGACCGA AACCTATGTC 3000 



Fig. 7C 
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Sequence: 34B12 EcoRI fragment From: 1 To: 4590 

3010 3020 3030 3040 3050 3060 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 

TCTTGTTAGC GTAGCCACCG GCCAGGCCGG TACGGACCCG GGATGGCCCT GGCGCGACCT 3060 
ATGCGGTTAG AATCCGCGGC CTTGCAGGCG GATACCCGAG CTTCGCTCGA AGGTGTCGCG 3120 
GTGCCGTGCC GTGGAATCGG CCGCCGGCTC GCTTTCTGCG CGGCGGGCGC ACGGCGACGG 3180 
GGAGTCGTCG TTTTGGAGGT GAGGGATGAC GGCTCTGTTT CAGGATATTT TTATAATTAT 3240 
GTGAAAGAAG AGCTTATTTC AACGAAATAT GTTTCATATT GCTCGTAAAT TCGACGAAAA 3300 

3310 3320 3330 3340 3350 3360 

i i i i I i i i i I i i i i I i i i i I i i i i I i i i 1 1 i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i I 

GAAAATCCGG ATATTTACCG GTTATTTAAC GTTAATACCA AGGGCCTAAT ACCAAAGTAT 3360 
TAACGCTGGC ATGCCGGCAT GTCGTGTTCG GTCGTGGAGC GAGCCGAGCC AGGAACGGTT 3420 
CTAAGAAACG AAAAAATTAT GTCGCGTAGG TCTAACGACC GAAACCTATG TCTTTTGTTA 3480 
GCGTAGCCAC CGGCCAGGCC GGTACGGATG CCGGGATGGC CCTGGCGCGA CCTATGCGGT 3540 
TAGAATCCGC GGCCTTGCAG GCGGATCCCC GGGGTTTGCT CAAGGGGACA CGGGTGCCGT 3600 

3610 3620 3630 3640 3650 3660 
i i i i I i i i i I i i i i 1 i i i i I i i i i I i it i 1 i i i i I i i i i I i i i i I i i i 1 1 i i i i I i i i i I 

GCCCGAAACC TGCAATCGTC AGTTCCCTGC GGTCCAGCCT GCCGCCGGGT ATAAAATCGA 3660 

GAGACGCGCT GTTGCGCCTT CAGGTGTAGC GACTATGACG CACATTTCCG AACGACTCCT 3720 

GGTACAGGCC CACCTGGCCG CCAAGCAACC CCGTGTGTTG AGCGAGCAGG AGAGCGCCGA 3780 

GCATCGCGCG GCGATCGCGG CCGAACTGAA GGCGCAAAAT GCTGTACTGG TGGCGCATTA 3840 

CTACTGCGAC CCGGTGATCC AGGCGTTGGC CGAGGAGACC GGCGGTTGCG TATCCGATTC 3900 

3910 3920 3930 3940 3950 3960 

1 1 1 i 1 1 i 1 1 I i i i 1 1 i i 1 1 1 i i i i I 1 1 i 1 1 1 1 i i I i i i i 1 i i i i I i 1 1 i I i 1 1 1 1 1 i i 1 1 

GCTGGAGATG GCCCGTTTCG GCAACCAGCA TCCGGCGCAG ACGGTGGTCG TGGCCGGGGT 3960 
GCGCTTCATG GGCGAGACGG CGAAGATCCT CAACCCTGAG AAGCGTGTGC TGATGCCGAC 4020 
CCTCGAAGCG ACCTGCTCGC TCGACCTGGG ATGCCCGGTG GATGAATTCT CGGCTTTCTG 4080 
CGACCAGCAC CCGGAACGGA CCGTGGTGGT CTATGCGAAC ACCTCCGCGG CGGTGAAGGC 4140 
ACGCGCCGAC TGGGTCGTGA CCTCCAGTTG CGCGGTGGAG ATCGTCGAAC ACCTGATGGA 4200 

4210 4220 4230 4240 4250 4260 
i i i i I i i i i I i i i i I ii i i I 1 1 i i I i 1 1 i I i 1 1 i I i i i i I i i i i 1 i i i i i i i i i I i i i i I 

CAACGGCGAG CCCATCCTCT GGGCGCCGGA CCAGCACCTG GGACGCTACA TCCAGCGCGA 4260 

GACGGGGGCC GACATGCTGC TCTGGGATGG CGCCTGTATC GTCCACGAGG AGTTCAAGGC 4320 

CAAGCAGCTG GAAGACATGA AGGCGCTCTA CCCGGACGCC GCCATCCTGG TCCACCCCGA 4380 

ATCGCCGGAA AGCGTGGTCG CGCTGGCCGA TGCCGTGGGC TCGACCAGCC AGTTGATCAA 4440 

GGCCGCGCAG ACCCTGCCGA ACAAGACCTT CATCGTCGCC ACCGATCGCG GCATCTTCTA 4500 



Fig. 7D 
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Sequence: 34B12 EcoRl fragment From: 1 To: 4590 

4510 4520 4530 4540 4550 4560 

I I I I I I I I I I I I I I 1 I I I [ I I I I I 1 I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I 1 

CAAGATGCAG CAGTTGTGCC CGGACAAGGA TTTCATCGAG GCCCCCACCG CCGGCAACGG 4560 
CGCCGCCTGC CGCAGTGCGC GCACTGCCCG 4590 



Fig. 7E 
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Sequence: 34B12 ORF 1 L-S From: 1 To: 1284 

10 20 30 40 50 60 

i i i 1 1 i i i i I 1 1 1 1 1 1 1 1 1 I 1 1 i i I i 1 1 i I 1 1 i i I 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 ii 1 1 i i 1 1 

ATGGCAAGAG CACGTCCCTG CGTTTCGATC AGCGTCACCC GCTGTTCGAC GGCATCGCTG 60 
ACCTGCGCGT CGCGCGCTAC CACTCGCTGG TGGTCAGTCG CCTGCCGGAA GGTTTCGACT 120 
GCCTGGCCGA TGCCGATGGC GAGATCATGG CGATGGCCGA TCCGCGCAAT CGACAGCTGG 180 
GCTTGCAATT CCATCCCGAG TCGATTCTCA CCACCCACGG CCAGCGTCTG CTGGAGAACG 240 
CTCTACTCTG GTGCGGCGCG TTGGCGGTCG CGGAGCGCCT TCGGGCCTGA GCGGCGCTGC 300 
310 320 330 340 350 360 

I I I I I I I I I I I I M 1 I I I I I I I I I I i i i i 1 i i i i I i i i i | ii I i 1 i i i i I i i i i I i i i i I 

GCAGTTTCGA CCGAGGCTCG GTTGCCAGGC CGGCGCATCG TCGAAACGCT GGCGGCCCAG 360 
TTCGCGCAGG CGCTGGCGGG CGCTTTCGAG AAAGCGACGG AAGCTGCGCT CGGATTCCAG 420 
CGCGGTGTTG TAGTAGCAAT ACACCTTGGT GTCGATGCCG CCCGGTTCGT ACAGTTCGCT 480 
GAGGACTGCC AGGGTACCGT TGCGCAGGCG TTCCTCGACG AAATAATGCG GCGAGATGCC 540 
CCATCCGACG CCGGCTTCCA CCAGACGCAG CATGTCGTCG AAGTTTTCCA CGAAGAGCAC 600 

610 620 630 640 650 660 

i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 

CTTGTCGCTG ACCGGCCGCA GCAGGTTCGA ATGCTGCCCG GAGCGGCTGC CGAGGCTGAT 660 
CTGCCGGTAA TTGGCCAGGC TCGCGATGCT GTGCAGGGAG GCATTGCACA ACGGGTGCTG 720 
CGGATGGGCG ACGACGAACG CCTTGGTGTA GCCGAGCACG CACTGGTTGA AGCGGGAGAT 780 
CTTCAGTTCC TCGTCGATGG TGATGGCGAT ATCGATTTCC GCGTTGTCCT GCTTGATCGT 840 
CGCCAGGCTA TCGGCGGGCG AGGTGCGTAT CAGGCTGACC ATGTTGAAAT CGTCGAGCAG 900 

910 920 930 940 950 960 

i i i i I i ii i I i i i i 1 i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i ii I i i i i I 

TACGCTGCTC ACCGTATCGC AGAACGACGG CGGGATGGCG GTGTCCAGCA GCACCCGGAG 960 
ATTGCGCGGA CCCTTGTTGA GATTGAAGGC GATGTCGCCG ATCAGCTGCT GGTAGTTCAG 1020 
CAGGCTGCGC ATGTAAGGGA TCAGGCGAAG CGCCTGCTCG GTGGGTTCGA CCTTGTAGCC 1080 
GTCCCGACGG ACCAGCTCCA CGCACAGGTC GATTTCCAGG TTGCTGACCG CCGAGCTGAC 1140 
CGCGGTGTGC GACTTGCGCA GGATCCGCGC AGCGGAGGAA ATCGAACCGG AGGCGATGAC 1200 
1210 1220 1230 1240 1250 1260 

I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i i i i I I I I I I I I I I I I i I i I I I I I I 

CTGGAGGAAC ATGTTCACGT GATTCAGGTT ATGAATAGGC ATCCCTTATT CCTTTTATTG 
GGTGGCGCGT GCCGCTTCCC TTGA 1284 
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Sequence: 34B12 ORF 1 (L-S) PROTEIN From: 1 To: 427 

10 20 30 40 50 

i i i i I i i i i I i i i i I i i i i 1 i i i i 1 i i ii I i i 1 1 I i i i i I i i i i I 1 1 i i I 

MARARPCVSI SVTRCSTASL TCASRATTRW WSVACRKVST AWPMPMARSW 
RWPIRAIDSW ACNSIPSRFS PPTASVCWRT LYSGAARWRS RSAFGPERRC 
AVSTEARLPG RRIVETLAAQ FAQALAGAFE KATEAALGFQ RGVWAIHLG 
VDAARFVQFA EDCQGTVAQA FLDEIMRRDA PSDAGFHQTQ HWEVFHEEY 
LVADRPQQVR MLPGAATEAD LPVIGQARDA VQGGIAQRVL RMGDDERLGV 
260 270 280 290 300 

I M I I I I I I I I I I I I I I I I 1 I I I I 1 I M I 1 I I I I I I I I I I I I I I I I I I I 1 

AEHALVEAGD LQFLVDGDGD IDFRWLLDR RQAIGGRGAY QADHVEIVEQ 
YAAHRIAERR RDGGVQQHPE IARTLVEIEG DVADQLLWQ QAAHVRDQAK 
RLLGGFDLVA VPTDQLHAQV DFQVADRRAD RGVRLAQDPR SGGNRTGGDD 
LEEHVHVIQV MNRHPLFLLL GGACRFP 427 
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Sequence: 34B12 ORF 2 From: 1 To: 1035 

10 20 30 40 50 -60 

i 1 1 1 1 i i i 1 1 1 1 i i I i i 1 1 I — i i i i I 1 1 1 1 1 i i i 1 1 i i 1 1 I i 1 1 i I i [ i i I 1 1 1 i I i i i i I 

ATGCCTATTC ATAACCT6AA TCACGTGAAC ATGTTCCTCC AGGTCATCGC CTCCGGTTCG 60 
ATTTCCTCCG CTGCGCGGAT CCTGCGCAAG TCGCACACCG CGGTCAGCTC GGCGGTCAGC 120 
AACCTGGAAA TCGACCTGTG CGTGGAGCTG GTCCGTCGGG ACGGCTACAA GGTCGAACCC 180 
ACCGAGCAGG CGCTTCGCCT GATCCCTTAC ATGCGCAGCC TGCTGAACTA CCAGCAGCTG 240 
ATCGGCGACA TCGCCTTCAA TCTCAACAAG GGTCCGCGCA ATCTCCGGGT GCTGCTGGAC 300 

310 320 330 340 350 360 

i i [ 1 1 1 1 1 i I 1 1 i i 1 1 1 1 i 1 — 1 1 1 1 I i 1 1 1 1 i 1 1 i I 1 1 1 i I 1 1 i i 1 1 i 1 1 l i i i 1 1 1 i i 1 1 

ACCGCCATCC CGCCGTCGTT CTGCGATACG GTGAGCAGCG TACTGCTCGA CGATTTCAAC 360 
ATGGTCAGCC TGATACGCAC CTCGCCCGCC GATAGCCTGG CGACGATCAA GCAGGACAAC 420 
GCGGAAATCG ATATCGCCAT CACCATCGAC GAGGAACTGA AGATCTCCCG CTTCAACCAG 480 
TGCGTGCTCG GCTACACCAA GGCGTTCGTC GTCGCCCATC CGCAGCACCC GTTGTGCAAT 540 
GCCTCCCTGC ACAGCATCGC GAGCCTGGCC AATTACCGGC AGATCAGCCT CGGCAGCCGC 600 
610 620 630 640 650 660 

I I I I I I I I I I I I I I I I I I I I I I I I ll M I I I I I I I I I I I I I l I I I I M I I I I I I I I I I I I 

TCCGGGCAGC ATTCGAACCT GCTGCGGCCG GTCAGCGACA AGGTGCTCTT CGTGGAAAAC 660 
TTCGACGACA TGCTGCGTCT GGTGGAAGCC GGCGTCGGAT GGGGCATCTC GCCGCATTAT 720 
TTCGTCGAGG AACGCCTGCG CAACGGTACC CTGGCAGTCC TCAGCGAACT GTACGAACCG 780 
GGCGGCATCG ACACCAAGGT GTATTGCTAC TACAACACCG CGCTGGAATC CGAGCGCAGC 840 
TTCCGTCGCT TTCTCGAAAG CGCCCGCCAG CGCCTGCGCG AACTGGGCCG CCAGCGTTTC 900 

910 920 930 940 950 960 

i i i i I i i i i I i i i i I i i i i I — i i i i I i 1 1 1 1 i i i i I i i i i I i i i i I i i i i I i i i i 1 i i [ i I 

GACGATGCGC CGGCCTGGCA ACCGAGCCTC GGTCGAAACT GCGCAGCGCC GCTCAGGCCC 960 
GAAGGCGCTC CGCGACCGCC AACGCGCCGC ACCAGAGTAG AGCGTTCTCC AGCAGACGCT 1020 
GGCCGTGGGT GGTGA 1035 
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Sequence: 34B12 ORF 2 PROTEIN From:: 1 To: 344 

10 20 30 40 50 60 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 

MPIHNLNHVN MFLQVIASGS ISSAARILRK SHTAVSSAVS NLEIDLCVEL VRRDGYKVEP 60 
TEQALRLIPY MRSLLNYQQL IGDIAFNLNK GPRNLRVLLD TAIPPSFCDT VSSVLLDDFN 120 
MVSLIRTSPA DSLATIKQDN AEIDIAITID EELKISRFNQ CVLGYTKAFV VAHPQHPLCN 180 
ASLHSIASLA NYRQISLGSR SGQHSNLLRP VSDKVLFVEN FDDMLRLVEA GVGWGIAPHY 240 
FVEERLRNGT LAVLSELYEP GGIDTKVYCY YNTALESERS FRRFLESARQ RLRELGRQRF 300 

310 320 330 340 350 360 

i i i 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 I 1 1 1 1 1 i 1 1 1 I i i ii I i i 1 1 1 i 1 1 i I 1 1 i 1 1 i i i i I i i 1 1 1 

DDAPAWQPSL GRNCAAPLRP EGAPRPPTRR TRVERSPADA GRGW 344 



Fig. 71 
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Sequence: 34B12 ORF 1 From:: 1 To: 759 

10 20 30 40 

i i i 1 1 1 i 1 1 1 i i 1 1 1 1 1 i 1 1 i i i 1 1 1 1 1 i I 1 1 1 i 1 1 1 i i 1 

ATGC66CGAG ATGCCCCATC CGACGCCGGC TTCCACCAGA 40 
CGCAGCATGT CGTCGAAGTT TTCCACGAAG AGCACCTTGT 80 
CGCTGACCGG CCGCAGCAGG TTCGAATGCT GCCCGGAGCG 120 
GCTGCCGAGG CTGATCTGCC GGTAATTGGC CAGGCTCGCG 160 
ATGCTGTGCA GGGAGGCATT GCACAACGGG TGCTGCGGAT 200 

210 220 230 240 

i i i i 1 1 i i i I 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i I i 1 1 i 1 1 1 1 1 I 

GGGCGACGAC GAACGCCTTG GTGTAGCCGA GCACGCACTG 240 
GTTGAAGCGG GAGATCTTCA GTTCCTCGTC GATGGTGATG 280 
GCGATATCGA TTTCCGCGTT GTCCTGCTTG ATCGTCGCCA 320 
GGCTATCGGC GGGCGAGGTG CGTATCAGGC TGACCATGTT 360 
GAAATCGTCG AGCAGTACGC TGCTCACCGT ATCGCAGAAC 400 

410 420 430 440 
i i i i I i i i i I i i i i 1 i i i i I i i i i 1 i 1 1 i I i i i i 1 i i i i I 

GACGGCGGGA TGGCGGTGTC CAGCAGCACC CGGAGATTGC 440 
GCGGACCCTT GTTGAGATTG AAGGCGATGT CGCCGATCAG 480 
CTGCTGGTAG TTCAGCAGGC TGCGCATGTA AGGGATCAGG 520 
CGAAGCGCCT GCTCGGTGGG TTCGACCTTG TAGCCGTCCC 560 
GACGGACCAG CTCCACGCAC AGGTCGATTT CCAGGTTGCT 600 

610 620 630 640 

i i i i I i i i i I i i i i i i 1 1 i I 1 1 i i 1 1 1 1 i 1 i i i i 1 1 i i i I 

GACCGCCGAG CTGACCGCGG TGTGCGACTT GCGCAGGATC 640 
CGCGCAGCGG AGGAAATCGA ACCGGAGGCG ATGACCTGGA 680 
GGAACATGTT CACGTGATTC AGGTTATGAA TAGGCATCCC 720 
TTATTCCTTT TATTGGGTGG CGCGTGCCGC TTCCCTTGA 759 
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Sequence: 34B12 ORF 1 S PROTEIN From:: 1 To: 253 

10 20 30 40 

i i 1 1 1 1 1 1 i I 1 1 1 1 1 1 i i i I 1 1 i 1 1 i 1 1 i I i i i i I i 1 1 i I 

MRRDAPSDAG FHQTQHWEV FHEEHLVADR PQQVRMLPGA 40 
AAEADLPVIG QARDAVQGGI AQRVLRMGDD ERLGVAEHAL 80 
VEAGDLQFLV DGDGDIDFRV VLLDRRQAIG GRGAYQADHV 120 
EIVEQYAAHR IAERRRDGGV QQHPEIARTL VEIEGDVADQ 160 
LLWQQAAHV RDQAKRLLGG FDLVAVPTDQ LHAQVDFQVA 200 

210 220 230 240 

i i i i I i i i i I i i i i I i i i i I i i i i I i ii i I i i i i I i i i i I 

DRRADRGVRL AQDPRSGGNR TGGDDLEEHV HVIQVMNRHP 240 
LFLLLGGACR FP. 253 
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pho34B12 0RF1 (L-S) SEQ ID NO:107 

1 MARARPCVSI SVTRCSTASL TCASRATTRW WSVACRKVST AWPMPMARSW 

51 RWPIRAIDSW ACNSIPSRFS PPTASVCWRT LYSGAARWRS RSAFGPERRC 

101 AVSTEARLPG RRIVETLAAQ FAQALAGAFE KATEAALGFQ RGVWAIHLG 

151 VDAARFVQFA EDCQGTVAQA FLDEIMRRDA PSDAGFHQTQ HWEVFHEEY 

201 LVADRPQQVR MLPGAATEAD LPVIGQARDA VQGGIAQRVL RMGDDERLGV 

251 AEHALVEAGD LQFLVDGDGD IDFRWLLDR RQAIGGRGAY QADHVEIVEQ 

301 YAAHRIAERR RDGGVQQHPE IARTLVEIEG DVADQLLWQ QAAHVRDQAK 

351 RLLGGFDLVA VPTDQLHAQV DFQVADRRAD RGVRLAQDPR SGGNRTGGDD 

401 LEEHVHVIQV MNRHPLFLLL GGACRFP* 

Fig. 8 

phoB12 ORF2 SEQ ID NO: 108 

1 MPIHNLNHVN MFLQVIASGS ISSAARILRK 

51 VRRDGYKVEP TEQALRLIPY MRSLLNYQQL 

101 TAIPPSFCDT VSSVLLDDFN MVSLIRTSPA 

151 EELKISRFNQ CVLGYTKAFV VAHPQHPLCN 

201 SGQHSNLLRP VSDKVLFVEN FDDMLRLVEA 

251 LAVLSELYEP GGIDTKVYCY YNTALESERS 

301 DDAPAWQPSL GRNCAAPLRP EGAPRPPTRR 

Fig. 9 



36A4 SEQ ID NO: 109 

1 AAGGGTTTTG GCGGGGTCAT CCGAGTGACC CTGAGCATGC TCCTGGCGAT 
51 CTTCTTGTCG GTGCTGCTGG CGCCGGTGCG CATGCTGTTC CACACCCGCT 
101 TCGTGCTGGC CGCCTTCCTC GGCTGGTC 

Fig. 10 

36A4 SEQ ID NO:110 

1 KGFGGVIRVT LSMLLAIFLS VLLAPVRMLF HTRFVLAAFL GW 



SHTAVSSAVS NLEIDLCVEL 
IGDIAFNLNK GPRNLRVLLD 
DSLATIKQDN AEIDIAITID 
ASLHSIASLA NYRQISLGSR 
GVGWGIAPHY FVEERLRNGT 
FRRFLESARQ RLRELGRQRF 
TRVERSPADA GRGW* 



Fig. 11 
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contig 2507 SEQ ID NO: 111 

1 CTACTGGGGC AAGCTGAAGA CGCCGTTCAA GCTGAGCTTC TATCACCAGG 
51 GCATGCACTT CGACACGCCG GTGAAGATCA ACGAGGTGAC CGCTACCACG 
101 GTCAAGCCGA TCAAGTACGA TCGCACCAAG TTCGATTTCG GATCCCTGAA 
151 GTTCGACGAG AATGCCACCA AGGATCTCGG CTATGCCGGT TTCCGCGTGC 
201 TCTATCCGAT CAACAAGGCC GACAAGCAGG ACGAGATCGC CACCTTCCTT 
251 GGCGCGAGCT ACTTCCGCGT GGTCGGCAAG GGCCAGGTCT ACGGTCTGTC 
301 GGCGCGCGGC CTGGCGATCG ATACCGCGCT GCCTTCGGGC GAAGAGTTCC 
351 CGCGCTTCCG CGAATTCTGG ATCGAGCGGC CGAAGCGCAG GACAAGCAAC 
401 TGGTGATCTA CGCCCTGCTC GACTCGCCGC GGGCCACCGG CGCCTACCGC 
451 TTCGTGCTGC GTCCGGGCAA GGATGCGGTG ATGGATGTCC AGGCCCGCGT 
501 GTTCCTCCGC GACAAGGTCA GCAAGCTGGG CCTGGCGCCG CTGACCAGCA 
551 TGTACCTGTT CGGCTCCAAC CAGCCGTCCG AGCAGCACAA CTTCCGGCCC 
601 GAGCTGCATG ACTCCAGCGG CCTGCAGATC CATGCCGGCA ACGGCGAGTG 
651 GCTGTGGCGT CCGCTGAACA ATCCGAAGCA CCTGTCGGTG AGCACCTTCA 
701 GCGTGGAGAA CCCGAAAGGC TTCGGCCTGC TCCAGCGCGG CCGCGAGTTC 
751 TCCCGCTACG AAGACCTGGA TGACCGCTAC GACCTGCGTC CGAGTGCCTG 
801 GATCGAGCCG AAGGGCGACT GGGGCAAGGG CACCGTGGAA CTGGTGGAAA 
851 TCCCGACCCC GGACGAAACC AACGACAATA TCGTCGCGTT CTGGAACCCC 
901 GAGACCCAGC CTGAGGTCGG AAAGCCGCTG GACTTCGCCT ACCGCCTGCA 
951 CTGGACCATG GATGAAGACG AGCTGCACGA CCCGAAATCC TCCTGGGTCA 
1001 AGCAGACCAT GCGCTCGGTC GGCGACGTGA AGCAGAAGAA CCTGATCCGC 
1051 CAGCAGGACG GCAGCACCGC CCTGGTCGTC GACTTCGAAG GGCCGGCCCT 
1101 GAAGGACCTG GCGCCGGACG CGCCGGTGAC CACCCAGGTC AGCACCGACA 
1151 GCAACGCCGA GGTGGTGGAG AACAGCCTGC GTTACAACCC GGTCCTGAAA 
1201 GGCTGGCGCC TGACGCTGCG GATCAAGGTC AAGGATCCGA AGAAGCCGGT 
1251 GGAAATGCGC GCGGCGCTGG TCGACGAGGC GCAGAAGCCA CTGAGCGAAA 
1301 CCTGGAGCTA TCAGCTGCCT GCCGATGAAT AACCCATCCA CTACGAAAGC 
1351 ACCGCTGGCC GACTACCTCG CTCATCTTCC CCTGGCGGAA GAGGAGCGGG 
1401 AGCGCCTTGG CGAGTCCGCT TCCTTCTCCG AGCTGCACGC TCGCCTGGCG 
1451 GGAGCGGAAG GCGCCGCTGC CGATGCCGGG GGCGATCCCG CCCTGGCCTC 
1501 GGTACGCGCC CGCCTGCAGC TGGGCACCCC TGAGCTGGAC GACGCCGAGA 
1551 TGTTCGGCGT CGACGCCCAG GGTCGCACCT TCCTCAAGAT TTCCCCGCCG 
1601 ATCCGCCGTA CCAAGGTGAT TCCCGAGCCC TGGCGCACCA ACATCCTGGT 
1651 GCGCGGCTGG CGTCGGCTGA CCGGACGCAG CAACCCGCCC AAGCCCAAGC 
1701 GTGCCCTGCC GCGGGCCCGC TGGCAGCGGG TCGGCTCGCT GCGCCGGTTC 
1751 ATCCTGCTGT TGTTGATGCT GGCGCAGACC TCGGTCGCCA CCTACTACAT 
1801 GAAAGGCATC CTGCCCTACC AGGGCTGGGC CTTCGTCGAC CTGGAGGAGC 
1851 TGGCCCAGCA GAGCCTGCTG GATACCGTCC AGCAGGTGCT GCCCTATGTC 
1901 ATCCAGTTCG GCATCCTGGC GCTCTTCGCG ATCCTCTTCT GCTGGGTCTC 
1951 GGCCGGCTTC TGGACCGCGC TGATGGGCTT CTGGGAGCTG CTCACCGGGC 
2001 GTGACCGCTA CCGGATCTCC GGCAGCAGCG CCGGCAGCGA GCCGATCGCC 
2051 GCCGACGCCC GCACGGCGAT CGTCATGCCG ATCTGCAACG AAGACGTGCC 
2101 GCGGGTATTC GCCGGCCTGC GGGCGACCGT CGAGTCGATG GCCGCCACCG 
2151 GCGAGATGGA GCGCTTCGAC TTCTTCGTCC TCAGCGACAC CAACGACCCG 
2201 GATATCGCCG TCGCCGAGCA GCAGGCCTGG CTCGAGCTGT GCCGCGAGAC 
2251 CAAGGGCTTC GGCAAGATCT TCTACCGTCG CCGCCGGCGC CGGGTGAAGC 
2301 GCAAGAGCGG CAACATCGAC GACTTCTGCC GGCGCTGGGG CGGCGACTAC 
2351 CGCTACATGG TGGTGATGGA CGCCGACAGC GTGATGAGCG GCGACTGCCT 
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2401 GGCCAAGCTG GTACGCCTGA TGGAGGCCAA TCCTGAGGCG GGGATCATCC 
2451 AGACCGCGCC GAAGGCTCCG GCATGGACAC CCTGTATGCG CGCATGCAGC 
2501 AGTTCGCCAC CCGCGTCTAC GGCCCGCTGT TCACCGCCGG CCTGCACTTC 
2551 TGGCAACTCG GCGAGTCGCA CTACTGGGGC CACAACGCGA TCATCCGCAT 
2601 GCAGCCCTTC ATCGACCACT GCGCCCTGGC GCCGTTGCCG GGCAAGGGCT 
2651 CGTTCGCCGG CGCGATCCTG TCCCACGACT TCGTCGAGGC TGCGTTGATG 
2701 CGCCTTGCCG GCTGGGGCGT GTGGATCGCC TACGACTTCG ACGGCAGCTA 
2751 CGAAGAACTG CCGCCGAACC TGCTCGACGA ACTCAAGCGC GACCGCCGCT 
2801 GGTGCCACGG CAACCTGATG AACTTCCGCC TGTTCCTGGT CAAGGGCATG 
2851 CACCCGGTGC ACCGCGCGGT GTTCCTCACC GGGGTCATGT CCTACCTGTC 
2901 GGCGCCGTTG TGGTTCTTCT TCCTGGTGCT TTCCACCGCG CTGCTGGCGG 
2951 TGCACCAACT GATGGAGCCG CAGTACTTCC TGGAACCGCG GCAGCTGTTC 
3001 CCGATCTGGC CGCAGTGGCA TCCGGAGAAG GCCATCGCGT TGTTCTCCAC 
3051 CACCCTGACC CTGTTGTTCC TGCCCAAGCT GCTCAGCGTA ATGCTGATCT 
3101 GGGCCAAGGG CGCCAAGGGT TTCGGCGGGG TGATCCGGGT GACCCTGAGC 
3151 ATGCTCCTGG AGATGTTCTT CTCGGTGCTG CTGGCGCCGG TGCGCATGCT 
3201 CTTCCACACC CGCTTCGTGC TGGCCGCCTT CCTCGGCTGG TCGGTGCAGT 
3251 GGAACTCGCC GCAGCGCGAC GACGACGCCA CGCCCTGGAG CGAGGCGATC 
3301 CGCCGGCACG CAATGCAGAC CCTGCTGGGT ATCGCCTGGA CCCTGCTGGT 
3351 GGCCTGGCTC AACCCGCGCT TCCTGTGGTG GCTGTCGCCG ATCGTCGGTT 
3401 CGCTGATCCT GTCGATCCCG GTATCGGTGA TCTCCAGCCG GGTGAAGCTG 
3451 GGCCTGCGGG CCCGCTACGA AAAGCTGGTC CTGATCCCGG AGAGTACGAC 
3501 ACGCCGCGCG ACTGCGCGCC ACCGACGAGT ACACCTACGA GAACCGCTGG 
3551 CATGCGCTCA AGGATGGCTT CCTCAAGGCC GCCGTCGATC CGTTGCTCAA 
3601 CGCCCTGGCC TGCGCCATGG GCACGGCTCG CCACAACCGT GCGCAGGCCA 
3651 TCGAGACGGT GCGTGGCGAG CGTATCGGCA AGGCCATCGA TAAGGGCCCG 
3701 GAACAGCTCG ACGGCGCCAC GCGCCTGGCT CTGTTGAGTG ACCCGGTAGC 
3751 ACTTTCGCGC CTGCATACGC GGGTCTGGGA AGAGGACCGC GACGACTGGC 
3801 TCGGCCGCTG GCGCAAGGCC GAGGCGGACG ACCCCCACGC CGCCAGCGTA 
3851 CCGCTGGCCC AGGTAGTGCC CGGCGACGCC GGCCTGCTGC CCGCCGCCCA 
3901 GTCCTGATCC CATGCCCCCG GCGGAACGCC GCCGGGGGCA TGGGTCTGTT 
4001 TCTTGCCTGT TTTCCCCGTG CGGCGCTGCT GTTACCCTGC GCCGGCAATC 
4051 CAGAAAGTCT CGTATCGTTC GCCAGCTGAG GTACTATCGG CCGCCTTTTG 
4101 CGCAGCCGGT CATGGCCTGC TGCCCGCCCG GGACGGCGAC ACGACGAGAG 
4151 CATCCGTTCG ACGACTGTGT TTCTAAGACT GCTGGGGATT GGGGAATGAA 
4201 AAAGTATCTT GCTTCATTGG TTCTGGGCGT CTGCGCCCTG GTGGGCGTGG 
4251 CTTCGGTCCA GGCGGCCGGC GCGGTGGAGG ACGCGGTCAA GCGCGGCACC 
4301 CTGCGGGTCG GCATGGACCC GACCTACATG CCGTTCGAGA TGACCAACAA 
4351 GCGTGGCCAG ATCATCGGCT TCGAAGTCGA CCTGCTCAAG GCCATGGCCA 
4401 AGTCCATGGG CGTCAAGCTG GAGCTGGTCT CCACCAGCTA CGACGGCATC 
4451 ATCCCGGCGC TGCTGACCGA CAAGTTCGAC ATGATCGGCT CGGGCATGAC 
4501 CCTGACCCAG GAGCGCAACC TGCGCCTGAA CTTCTCCGAG CCCTTCATCG 
4551 TGGTCGGCCA GACCCTGCTG GTGCGCAAGG AACTGGAAGG CAAGATCAAG 
4601 TCCTACAAGG ACCTGAACGA TCCGCAGTAC AGCATCACCT CGAAGATCGG 
4651 CACCACCGGT GAGATCGTTG CCCGCAAGCT GATCAGCAAG GCCAAGTACC 
4701 ACGGCTTCGA CAACGAGCCG GAAGCGGTGA TGGACGTGGT CAACGGCAAG 
4751 GCCGACGCCT TCATCTACGA CTCGCCCTAC AACGTGGTGG CGGTGAGCAA 
4801 GTTCGGCGCC GGCAAGCTGG TCTACCTCGA CCAGCCGTTC ACCTACGAGC 
4851 CGCTGGCGTT CGGCCTGAAG AAAGGCGACT ACGACAGCAT CAATTTCATC 
4901 AACAACTTCC TCCATCAGAT CCGCGAAGAC GGCACCTATC AGCGCATCCA 

Fig. 12B 
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4951 CGACAAGT66 TTCAAGAACA 
5001 CTGACGGCCC CCGCGAAGGG 
5051 AGAGAGTTTC CGTGACCAAG 
5101 CTGACCGGGC TGATCCTGCT 
5151 CTCGCTGATT TCCTATGAAA 
5201 CTACCAGGCC GAGGAGCCGT 
5251 CTATCGAGGA ACAGGGCAGG 
5301 GCGAGAAGCA GGTCGTGACC 
5351 GCGACGACGT GGCCGAGGGC 
5401 GCCGGCACTG CTCTGGGGCC 
5451 CCGGTGCCAT CGGTCTGGCT 
5501 TCGAAGAACC CGACCCTGCA 
5551 GCGCGGCACG CCGTTGCTGG 
5601 GCACCGTGCT CAACCTGTCC 
5651 CTGTTCACCG GCGCCTACGT 
5701 CATCGCCCGC GGACAGAACG 
5751 GCCAGTCGAT GCGCTACGTG 
5801 CGCCGCTGGC CGGGCAGTTC 
5851 TCGGTGATCG CCATCACCGA 
5901 CCACTTCGTT CTCCAACTTC 
5951 CCTGCTGTTG AACCTGCCCC 
6001 GGCTCGGACA AAGTGATTGA 
6051 CCGCGGCCAG GTAGTGCGCG 
6101 GGGGCGAGGT ACTGGTGGTG 
6151 TTCCTGCGCT GCCTGAACGG 
6201 CATCGACGGC GTCGACCTGG 
6251 GCCGCGAAGT CGGCATGGTG 
6301 ACCGTGCTCG AGAACCTCTG 
6351 CAAGGCCGAG CGCGAGGCCA 
6401 TCGGGCAGAA GGCCGACGAA 
6451 CAGCGCGTGG CGATCGCTCG 
6501 GTTCGACGAA CCGACCTCGG 
6551 TCGACGTCAT GAAGACCCTG 
6601 ACCCACGAGA TGGGCTTTGC 
6651 CGACCACGGC AAGCTGCTGG 
6701 ATCCGCAGGA CCCGCGGGCC 
6751 CGCGCTAGGC GAACGGCTTG 
6801 TGCCGCGCGG CCGGCTGGAT 
6851 TTCGCGCAGG CGCTCGTTCT 
6901 GGTACTCCAT CAGAGAGTGC 
6951 AGCGCGTTG 



CCGAGTGGCT GAAGGAAATG GAATGAACCG 
GGCCGTCGTA CCTGCGCATT CCATCGTTCG 
AAGAAACGTT CCGTCTGGCC CTGGCACCTG 
GGTCATGGCC TGGGCGCTGT GGTTCTCCAC 
TGGCGTGGGA CCGCGTTTCC GAGTACTTCG 
TACGGGCCAA CGAGATCGGC CGGGTCGAGG 
GACGCGCGCG TCACGCTGCT TGGCGAGACG 
GTTGCCCAGG ACAGCCTGCA ATTCTCCGAA 
GACGCGGTCG GGGTGACCCG CCACTGGGCC 
TGTGGACCAC CCTCTGGCTA TCGCTGGTGT 
ATCGGCCTGG TCGCCGGCCT CTGCCGGCTG 
CGACCTGTCG ACGATCTACG TCGAGCTGGT 
TGCAGATCTT CATCTTCTAC TTCTTCATCG 
CGCGAGTTCG CCGGGGTTGC GGCGCTGGCG 
GGCCGAGATC ATCCGGGCCG GCGTGCAGTC 
AGGCCGCCCG CTCCCTGGGC CTGAACGCCG 
ATCCTGCCGC AGGCTTCAAG CGCGTGCTGC 
ATCAGCCTGG TCAAGGACAC CTCGCTGGTC 
ACTGACCAAG AGCGGCCGCG AGGCGATCAC 
GAGATCTGGT TTCTGCGTCG CCGCGTTGTA 
TTTCGCACAT GGCATCCCGA CTGGAGCGGA 
AGTACGCAAC CTGCTGAAGG TCTTCGATAC 
CGGTGGACGA CGTGAGTACC CGCGTGGCCA 
ATCGGTCCGT CCGGTTCCGG CAAGTCGACC 
CCTGGAGGAG TTCGACGAAG GCTCGGTGAG 
CCGACCCGAG GACCGACATC AATGCCTACC 
TTCCAGCATT TCAACCTGTT CCCGCACATG 
CCTGGCCCAA CGCGTGGTGC GCAAGCGCGG 
AGGCGCGGGC GCTGCTGGCC AAGGTCGGCA 
TATCCCTCGC GCCTGTCCGG CGGCCAGCAG 
CGCGTTGTGC ATGGACCCCA AGGTGATGCT 
CGCTCGATCC GGAGATGGTC GGCGAAGTCC 
GCCGTGGAAG GCATGACCAT GGTCTGCGTG 
CCGCGAAGTG GCCGACCGCG TGCTGTTCTT 
AGGACGCGCC GCCGGCGCAG TTCTTCGACA 
CAGGCCTTCC TCCGCCAGGT CCTCTAGTAC 
CCCGGCGGCG GCAGGAGCGA CGTCGGACTC 
ATCGTTGTCC TCCAGCCAGT CCAGCGCCCA 
GGTAGCGGTA CCAGTCCTGC AACAGTTCCG 
TTGAAGGCCT TGAACGGCTT GCGGCTCTGC 
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23A2 DNA SEQ ID NO:H2 

1 CGAG6TTTCC GTCTACGAAG GCACCGGCTC GGTCACCATC CGCGCCGTGT 
51 TCCCCAACCC GAACAACGAG CTGCTCCCCG GCATGTTCGT TCACGCGCAG 
101 TTGCAGG 

Fig. 13 

23A2 peptide SEQ ID NO:113 
1 EVSVYEGTGS VTIRAVFPNP NNELLPGMFV HAQLQ 



Fig. 14A 
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SEQ ID NO:148 
DNA flanking the 23A2 locus. 
mexA partial sequence, mexB partial sequence 

1 ggccaggcaa acgcgatggc caccgtgcaa cagctcgacc cgatctacgt cgacgtcacc 
61 cagccgtcca ccgccctgtt gcgcatgcgc cgcgaactgg ccagcggcca gttggagcgc 
121 gccggcgaca acgctgcgaa ggtctccctg aagctggagg acggtagcca atacccgctg 
181 gaaggccgcc tcgaattctc cgaggtttcc gtcgacgaag gcaccggctc ggtcaccatc 
241 cgcgccgtgt tccccaaccc gaacaacgag ctgctgcccg gcatgttcgt tcacgcgcag 
301 ttgcaggaag gcgtcaagca gaaggccatc ctcgctccgc agcaaggcgt gacccgcgac 
361 ctcaagggcc aggctaccgc gctggtggtg aacgcgcaga acaaggtcga gctgcgggtg 
421 atcaaggccg accgggtgat cggcgacaag tggctggtca ccgaaggcct gaacgccggc 
481 gacaagatca ttaccgaagg cctgcagttc gtgcagccgg gtgtcgaggt gaagaccgtg 
541 ccggcgaaga atgtcgcgtc cgcgcagaag gccgacgccg ctccggcgaa aaccgacagc 
601 aagggctgat caaggggatt cgtaatgtcg aagtttttca ttgataggcc cattttcgcg 
661 tgggtgatcg ccttggtgat catgctcgcg ggcggcctgt cgatcctcaa tctgccggtc 
721 aaccagtacc cggccatcgc cccgccggcc atcgccgtgc aggtgagcta cccgggcgcc 
781 tcggccgaga cggtgcagga caccgtggtc caggtgatcg agcagcagat gaacgggatc 
841 gacaatctgc gctacatctc ctcggagagt aactccgacg gcagcatgac catcaccgtg 
901 accttcgaac agggcaccga ccccgacatc gcccaggtcc aggtgcagaa caagctgcaa 
961 ctggccaccc cgctgctgcc gcaggaagtg cagcgccagg ggatccgg 

Fig. 14B 

SEQ ID NO:149 
PA14 mexA 

G QANAMATVQ QLDPIYVDVT QPSTALLRMR RELASGQLER AGDNAAKVSL KLEDGSQYP LEGRLEFSE 
VSVDEGTGS VTIRAVFPN PNNELLPGM FVHAQLQEG VKQKAILAP QQGVTRDLK GQATALWN 
AQNKVELRV IKADRVIGD KWLVTEGLN AGDKIITEG LQFVQPGVE VKTVPAKNV ASAQKADAA PAKTDSKG 

Fig. 14C 

SEQ ID NO:150 
PA14 mexB 

MSKFFIDRP I FAWVI ALVIMLAGGLS ILNLPVNQYPAIAPPAI A 

VQVSYPGASAETVQDTWQVIEQQMNGIDNLRYISSESNSDGSMTITVTFEQGTDPDI 

AQVQVQNKLQLATPLLPQEVQRQGIR 



Fig. 14D 
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PAOl Phenelzine operon SEQ ID N0:114 

1 GCAAGCTCAA CTCCAGCAAC AAGGCGGAGG CCACCATGAA GGCTTACGCC 
51 ATCGGCCTGC TCAACTGAAT CGACGCCTCG TCGCCTAGCG AGGCCGCCGC 
101 GCAAGCGTCC GGCCATTCAC CGAATGGCCG GATAGCGTTT GCGCCGGTCG 
151 CCTGAGCGCA CGCTTCCCAC CGGCAGCGTT TCCCCGCTGC CCCCTTCGCC 
201 ATTGCGCCCG TCCTCATGTT GTCCGGACGC TAGTCGAACT TTCCGGGCGC 
251 CTGGCAAACC GGCCAAAGAA TAGAACGGAA TCGATGCCCA CACCTTTAAT 
301 TTTTAAGGGT TTTTCCTTTT CAAAAACCGT TATTAAGTTT TCCCCTTTAA 
351 ATCTTGGTAC AACTGGGTTC AGGCGAAACT TCGGTCATGC CATTCGGCAT 
401 TAGTTAAACT TTGAGACTCT CCAAGCGGGA ATTTTTGCCG GAACAGCTTC 
451 ACGGCATTTC TCCGCTTTCA TCCCGATGTT TCTTTCCGTT ATGATTCCAG 
501 TCGATTCGAA CTGCCGGAGT TCCCACCATT CGAGATTACC AACGTTGAAA 
551 AGGGTTTACC GACAACCTGG AATTGCGTCG GCGCAACCGT GCCACGGTCG 
601 AGCACTACAT GCGCATGAAG GGGGCCGAAC GGTTACAGCG GCACAGCCTG 
651 TTCGTCGAGG AGGCTGCGCC GGCAACTGGA CCACGGAAAG CGGCGAACCC 
701 CTGGTTTTCC GGGGCCATGA GAGCCTCAGG CGGCTCGCCG AGTGGCTCGA 
751 GCGCTGCTTC CCCGACTGGG AGTGGCACAA CGTGCGGATC TTCGAGACCG 
801 AGGATCCGAA CCACTTCTGG GTCGAGTGCG ACGGGCGCGG CAAGGCGCTG 
851 GTCCCGGGGT ATCCGCAGGG CTATTGCGAG AACCACTACA TCCATTCCTT 
901 CGAACTCGAG AACGGCCGGA TAAAACGCAA TCGCGAGTTC ATGAACCCGA 
1001 TGCAGAAATT GCGTGCATTG GGAATAGCCG TTCCACAAAT AAAACGTGAC 
1051 GGTATTCCCA CCTGATTAAT GTCTATTCCA ATTCAAGAGG AGATATGACG 
1101 ATGCTCGATA ATGCCATTCC TCAAGGTTTC GAAGACGCCG TGGAGTTGCG 
1151 CAGGAAGAAT CGCGAGACGG TGGTCAAGTA TATGAACACC AAAGGCCAGG 
1201 ATCGCCTGCG CCGCCATGAA CTTTTCGTCG AGGACGGCTG TGGCGGTTTA 
1251 TGGACCACCG ATACCGGCTC GCCCATCGTC ATTCGTGGCA AGGACAAGCT 
1301 GGCCGAGCAC GCGGTGTGGT CGCTGAAATG CTTCCCGGAT TGGGAGTGGT 
1351 ACAACATCAA GGTCTTCGAG ACCGACGATC CCAACCACTT CTGGGTCGAG 
1401 TGCGACGGCC ACGGCAAGAT CCTCTTCCCC GGCTATCCCG AGGGCTACTA 
1451 CGAGAACCAC TTCCTGCATT CCTTCGAGCT GGACGACGGC AAGATCAAGC 
1501 GCAACCGCGA ATTCATGAAC GTCTTCCAGC AATTGCGCGC CCTGAGCATT 
1551 CCGGTCCCGC AGATCAAACG CGAAGGCATT CCCACCTGAG GCCATCCTGG 
1601 AAGGGGTGAA CTATGGACGA TCTATTGCAA CGCGTACGGC GCTGCGAAGC 
1651 GCTGCAGCAA CCCGAATGGG GCGATCCGTC GCGCCTGCGC GACGTGCAGG 
1701 CGTACCTGCG CGGCAGTCCG GCGCTGATCC GCGCCGGCGA CATCCTGGCC 
1751 CTGCGCGCGA CCCTGGCGCG GGTCGCCCGC GGCGAGGCGC TGGTGGTGCA 
1801 GTGCGGCGAC TGCGCCGAGG ACATGGACGA CCACCATGCC GAGAACGTGG 
1851 CGCGCAAGGC CGCCGTGCTG GAACTGCTGG CCGGCGCCCT GCGCCTGGCC 
1901 GGCCGGCGGC CGATAGATCC GCGTCGGGCG CATCGCCGGG CAGTACGCCA 
1951 AGCCGCGTTC CAAGCCGCAC GAGCAGGTCG GCGAGCAGAC CCTGCCGGTC 
2001 TATCGCGGCG ACATGGTCAA CGGCCGCGAG GCCCATGCCG AACAGCGCCG 
2051 GGCCGATCCG CAGCGGATCC TCAAGGGCTA TGCGGCGGCG CGCAACATCA 
2101 T 



Fig. 15 
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3E8 sequence SEQ ID NO:115 



1 C6GC6CCGAG GATCCGCTGT TCGAGTTAGG CGCAAGCGTC CGGCCATTCA 
51 CGGAATGGCC GGATAGCGTT TGCGCCGGTT GCTTGAGCGC AGCTTCCCAC 
101 CGGCAGGGTT TCCCCGCTGC CCCTTTCGCC ATTGCGCCGT CCTCTTGTTG 
151 TCCGGCACGC TAGTGCAACT TTCCGGACGC TTGGCAAACC GGCCAAAGAA 
201 TAGAACGGAA TCGATGCCCC ACACCTGTAA TTTTTAAGGG GTTATGGCTA 
251 TTGCAAAAAA GCGTTTATAA GTTTGTCCCC TGTCAAATCT GGTTACAACT 
301 GGGTTTCAGG CGAAACATTC GGTCATGGCA ATTCGGCATT AGTTGAAACT 
351 TTGGAGACGC TCCGAAGCGG GCAACTTTTG CCCGGAAAAA GTTTCACGGC 
401 AATTTTTCCG GCCTGTCATC CCGATGTCTT CTTTCCAGTA TGGATGCCAG 
451 TCGATTCGAA CTGGCGGAGA TTCGCACCAT GCGAGAGTAC CAACGGTTGA 
501 AAGGGTTTAC CGACAACCTG GAATTGCGGC GGCGCAACCG TGCCACGGTC 
551 GAGCACTACA TGCGCATGAA GGGGGCCGAA CGGTTGCAGC GGCACAGCCT 
601 GTTCGTCGAG GACGGCTGCG CCGGCAACTG GACCACGGAA AGCGGCGAAC 
651 CCCTGGTTTT CCGGGGCCAT GAGAGCCTCA GGCGGCTCGC CGAGTGGCTC 
701 GAGCGCTGCT TCCCCGACTG GGAGTGGCAC AACGTGCGGA TCTTCGAGAC 
751 CGAGGATCCG AACCACCTCT GGGTCGAGTG CGACGGGCGC GGCAAGGCGC 
801 TGGTCCCGGG GTATCCGCAG GGCTATTGCG AGAACCACTA CATCCATTCC 
851 TTCGAACTCG AGAACGGCCG GATAAAACGC AATCGCGAGT TCACGAACCC 
901 GATGCAGAAA TTGCGTGCAT TGGGAATAGC CGTTCCGCAA ATAAaACGTG 
951 ACGGCATTCC CACCTGATTA ATGATTATTC CAATTCAAGA GGAGATATGA 
1001 CGATGCTCGA TAATGCTATT CCCCAAGGTT TCGAAGACGC CGTGGAGTTG 
1051 CGCAGGAAGA ATCGCGAGAC GGTGGTCAAG TATATGAACA CCAAAGGCCA 
1101 GGATCGCCTG CGCCGCCATG AACTTTTCGT CGAGGACGGC TGTGGCGGTT 
1151 TATGGACCAC CGATACCGGC TCGCCCATCG TCATTCGTGG CAAGGACAAG 
1201 CTGGCCGAGC ACGCGGTGTG GTCGCTGAAA TGCCTTCCCG GATTGGGAGT 
1251 GGTACAACAT CAAGGT 



Fig. 16A 



105/133 



3E8 SEQUENCE TAG SEQ ID NO:160 

1 tatggatgcc agtcgattcg aactggcgga gattcgcacc atgcgagagt accaacggtt 
61 gaaagggttt accgacaacc tggaattgcg gcggcgcaac cgtgccacgg tcgagcacta 
121 catgcgcatg aagggggccg aacggttgca gcggcacagc ctgttcgtcg aggacggctg 
181 cgccggcaac tggaccacgg aaagcggcga acccctggtt ttccggggcc atgagagcct 
241 caggcggctc gccgagtggc tcgagcgctg cttccccgac tgggagtggc acaacgtgcg 
301 gatcttcgag accgaggatc cgaaccacct ctgggtcgag tgcgacgggc gcggcaaggc 
361 gctggtcccg gggtatccgc agggctattg cgagaaccac tacatccatt ccttcgaact 
421 cgagaacggc cggataaaac gcaatcgcga gttcacgaac ccgatgcaga aattgcgtgc 
481 attgggaata gccgttccgc aaataaaacg tgacggcatt cccacctgat taatgattat 
541 tccaattcaa gaggagatat gacgatgctc gataatgcta ttccccaagg tttcgaagac 
601 gccgtggagt tgcgcaggaa gaatcgcgag acggtggtca agtatatgaa caccaaaggc 
661 caggatcgcc tgcgccgcca tgaacttttc gtcgaggacg gctgtggcgg tttatggacc 
721 accgataccg gctcgcccat cgtcattcgt ggcaaggaca agctggccga gcacgcggtg 
781 tggtcgctga aatgcttccc ggattgggag tggtacaaca tcaaggtctt cgagaccgac 
841 gatcccaacc acttctgggt cgagtgcgac ggccacggca agatcctctt ccccgggtat 
901 cccgagggtt actacgagaa ccacttcctg cattccttcg agctggacga cggcaagatc 
961 aagcgcaacc gcgaattcat gaacgtcttc cagcaattgc gcgccctgag cattccggtc 
1021 ccgcagatca aacgcgaagg cattcccacc tgaggccatc ctggaagggg tgaactatgg 
1081 acgatctatt gcaacgcgta cggcgctgcg aagcgctgca gcaacccgaa tggggcgatc 
1141 cgtcgcgcct gcgcgacgtg caggcgtacc tgcgcggcag tccggcgctg atccgcgccg 
1201 gcgacatcct ggccctgcgc gcgaccctgg ccgggtcgcc cgcggcgagg cgctggtggt 
1261 gcagtgcggc gactgcgccg aggacatgga cgaccacca 



Fig. 16B 
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3E8phzA SEQIDNO:116 

1 MREYQRLKGF TDNLELRRRG SAVRVRRKRP AIHGMAGZRL RRLLERSFPP 
51 AGFPRCPFRH CAVLLLSGTL VQLSGRLANR PKNRTESMPH TCNFZGVMAI 
101 AKKRLZVCPL SNLVTTGFQA KHSVMAIRHZ LKLWRRSEAG NFCPEKVSRQ 
151 FFRPVIPMSS FQYGCQSIRT GGDSHHARVP TVERFTDNLE LRRRNRATVE 
201 HYMRMKGAER LQRHSLFVED GCAGNWTTES GEPLVFRGHE SLRRLAEWLE 
251 RCFPDWEWHN VRIFETEDPN HLWVECDGRG KALVPGYPQG YCENHYIHSF 
301 ELENGRIKRN REFTNPMQKL RALGIAVPQI KRDGIPTZLM IIPIQEEIZR 
351 CSIMLFPKVS KTPWSCAGRI ARRWSSIZTP KARIACAAMN FSSRTAVAVY 
401 GPPIPARPSS FVARTSWPST RCGRZNAFPD WEWYNIK 



Fig. 17 
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3E8 phzB SEQ ID NO:117 

1 MLDNAIPQGF EDAVELRRKN RETWKYMNT KGQDRLRRHE LPVED6CGGL 
51 WTTDTGSPIV IRGKDKLAEH AVWSLKCLPG LGWQHQG 



Fig. 18A 



3E8 PHZA SEQ ID NO:161 

MREYQRLKGFTDNLELRRRNRATVEHYMRMKGAERLQRHSLFVE 

DGCAGNWTTE S GEP LVFRGHE SLRRLAEWLERCFPDWE WHNVR I FETEDPNHLWVECD 

GRGKALVPGYPQGYCENHYIHSFELENGRIKRNREFTNPMQKLRALGIAVPQIKRDGIPT 



Fig. 18B 



PhzB SEQIDNO:162 



MLDNAI PQGFEDAVELRRKNRETWKYMNTK6QDRLRRHELFVEDGCGGLWTTDTGSP I VI RGKDKLAEHAVWSLKCF 
PDWEWYNIKVFETDDPNHFWVECDGHGKILFPGYPEGYYENHFLHSFELDDGKIKRNREFMNVFQQLRALSIPVPQIK 
REGIPT 



Fig. 18C 



PhzC SEQ ID NO:163 

MDDLLQRVRRCEALQQPEWGDPSRLRDVQAYLRGSPALIRAGDILALRATLAGSPAARRWWCSAATAPRTWTTT 

Fig. 18D 



PA14 phzR SEQ ID NO:164 
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phzR DNA sequence : 1161 bp 

C6TCGACGAGGCCCGC CATGGGCCAAGGTTTGTTGT CGGGAGGCgCTCCCGACGACGATG 

GAGCGTGCGAGAAGAACAATGAGAAAGACCGCCGTGAGGCCCATCGGAGAGCCGTTCTAC 

GGTTTCCGCAAAGATCCGGGGCGCCGTCCCCTCCAGCa CAGCGCAGTTCCTGCGCGGCGC 

CTCGTGTCCGTGCTCATCGAGAAGTTCTCTTCAGCCTCGTTTCGTCGTCGCCCGGCGGGC 

GGCGAATGGGCTCGACCTCGTCCGGAACACCCGCACAGGGCCGGTGGCGATATGTACTTC 

CAGGTCCGGCTTGATAAAGGGAATTGTCATGAGTGGATAAGACGGAAACAAAAAAGAATA 

AAAACGCTGAAGAACCGAATCCTGCCGGGATCGATTGTTGACTGGTGAAGCTGGCATGCA 

TGATGAGAGAGAGGGATATCTCGAGATTTTGTCAAGAATAACAACCGAGGAAGAGTTCTT 

CTCCCTGGTTCTCGAGATATGCGGTAATTATGGATTCGAATTCTTTTCATTCGGTGCGCG 

GGCGCCTTTCCCGCTGACCGCGCCTAAATATCATTTCCTGTCCAATTACCCAGGGGAATG 

GAAAAGCAGATATATCTCCGAAGACTACACATCCATCGACCCGATCGTGCGCCATGGTCT 

CCTGGAATACACCCCGCTGATCTGGAATGGCGAAGACTTCCAGGAGAACCGTTTCTTCTG 

GGAGGAAGCGCTGCATCACGGCATCCGTCACGGCTGGTCGATCCCGGTCCGCGGCAAGTA 

CGGGCTGATCAGCATGCTGTCCCTGGTGCGTTCCAGCGAGAGCATCGCCGCTACGGAAAT 

CCTGGAGAAGGAATCCTTCCTGCTCTGGATCACCAGCATGCTGCAGGCTACCTTCGGCGA 

CCTGCTGGCGCCGCGCATCGTCCCGGAAAGCAATGTGCGCCTGACCGCCAGGGAAACCGA 

GATGCTC^GTGGACCGCGGTGGGCAAGACCTACGGCGAGATCGGCCTGATCCTGTCGAT 

CGACCAGCGCACGGTGAAATTCCATATCGTCAATGCGATGCGCAAGCTCAACTCCAGCAA 

CAAGGCGGAGGCCACCATGAAGGCCTACGCCATCGGCCTGCTCAACTGAATCGACGCCTC 

GTCGCCTAGCGAGGCCGCCGC 

Fig. 18E 



PA14 PhzR SEQ ID NO:165 
PhzR peptide sequence 

MHDEREGYLEILSRITTEEEFFSLVLEICGNYGFEPFSFGARAPFPLTAPKYHFLSNYPG 
EWKSRYISEDYTSIDPIVRHGLLEYTPLIWGEDFQENRFFWEEALHHGIRHGWSIPVRG 
KYGLISMLSLVRSSESIAATEILEKESFLLWITSMLQATFGDLLAPRIVPESNVRLTARE 
TEMLKWTAVGKTYGEIGLILSIDQRTVKFHIVNAMRKLNSSNKAEATMKAYAIGLLNZ 

Fig. 18F 



34H4 SEQ ID N0:118 
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1 ACCAACATCC TGGTCCTGA6 CAACAGCCAG CGCCACGGCC TGGCCGCCGC 

51 CTGGCCGATC GTGCTCGGCG CCTGCGCGGC GGTGGCGGCG CTGATCCTGC 

101 TGCTCGGGCT CGGCCTGGGC GAGCTGCTGC GGCGCCACCC GTTGCTCCAG 

151 CAGGGGCTCG CCTGGCTTGG CGTCGGCTGG CTCAGCTACC TGGCCTGGAG 

201 CCTGTTCCGC AGCGCG 



Fig. 19 

33C7 SEQ ID NO:119 



1 CCACCGAAGT AACGGGTCAG CTCGTCGCAC AACAGGCGTC GCTCCTCGGC 
51 CTGCATCAGG CTGCCCAGCG GGCCCTGGAA CCAGTCGCGC GCGCCCGGTT 
101 GAT 



Fig. 20 

25al2.3 SEQ ID NO:120 

. _ . . 1 GCGGTGCCCT . GGATGTCGTC GTTGAAGCAG CACAGCTCGT CCTTGTAGCG 
51 CTCCAGCAAC GGCATGGCAT TGGTCTGGGC GAAGTCCTCG AATTGCAGCA 
101 GGACCTTGGG CCACGGCGCT TGATCGCCTG GATGAACAGG TCGACAA 



Fig. 21 



8C12 SEQ ID NO:121 



1 TATTTGTGTA TAAGNCTCAG GcTCtGGAGG GGCCGCTGGG CAGGCNNAAC 
51 NNCCTCGCGT NCTNGGCGAC GANTTNCNNA TGCTTCGCNT GCTGCCGGCG 
101 TCTCNNCCCT CNGTACTAgT CTACGCGTGG ACAACGTGGC 



2A8 SEQ ID NO:122 



Fig. 22 



1 NATTTGTGTA 

51 CCACCGCCGA 

101 CTTCGGGCAT 

151 AGGGTTGGTC 

201 GCGGTTTCCG 

251 CCCAACGCCG 

301 CCTT 



TAAGAGTCAG 
GCTGCTGAAG 
NCCNTCTNNA 
GAAGCAGCTC 
AGGCCATCGA 
ACTTCTACCA 



GATCGAACGC 
ATGCTCGAGC 
TCGATTCCTN 
GCCGACgAGG 
CAAGACCATG 
CGCCTCGGCG 

Fig. 23 



TTCTCTTCGC 
GCAAGGGACA 
CCCACGCAAC 
TCGGCGACAA 
TGGGAGCAGA 
TCNCCCNTCC 



CGCAGGAAAG 
AGATCATGGG 
GAAgTGATCA 
GGTCCTGTTC 
AGAACTGTTC 
NGTGCTTCCA 
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41A5 SEQ ID N0.123 
1 tcgttgtaca ggccgaacag gccgagctgc caggtgtcgc cctcg 

Fig. 24A 

50E12 SEQ ID NO: 124 
1 gagcagacct gggtacccat ggcttccttg acccgctgca cgatgatgcc cagcgccgcc 
61 ttcagatcct tggcggagtt ctcttcctgg acgatcttgc gcagcgtgtt gagcatgctc 
121 ggggccttgt ctccgtgttc agtcccgcgc cagaaggcgc ggggccagtt ccttcagggc 
181 gcggcggtag acctcgcgct tgaaggtcac cacctgtccc agggggtacc agtaactcac 
241 ccagcgccag ccgtcgaact cgggcttgct ggtgatatcc atgcgcacgc gcgcctcgtc 
301 ggacatcagc cgcagcagga accatttctg cttctggccg atgcacagcg gctggctgtg 
361 ggtccgcacc aggcgctgcg gcaaacggta gcgcagccag ccgcgg 

Fig. 24B 

35A9 SEQ ID NO: 125 
1 cgcgacagta gcatataatc aatcatgagt gattaattaa ttggcgtttc tgtaacatat 
61 ccttatgatc tgcggcgcct ttcccttgtg aggacgttca gtggccagga aaaccaaaga 
121 ggaatcccag aaaacccgcg acggcatact cgatgccgcc gagcgggttt tcctggaaaa 
181 gggcgtgggc accactgcca 

Fig. 24C 

pho23 SEQ ID NO: 126 
1 tcgatcccaa tgactacaag gacgaaatcc gccagatcgc ccgcgacaag gccaacctgg 
61 agctggacct gaagggcgac atcggctgga gcctgttccc ctggctgggc ctggagc 

Fig. 24D 

6G12 SEQ ID NO: 127 
1 ggataggtgc ggcggaaaac gtacgggacg aaagagcggt tttcccgaat gacgcatcct 
61 cctgcaagcg caacttgctg gtggtcgata gcaagtaagg cgcgagacat gtcctgaact 
121 tcatgggggc tttttcttat agggcggact gtcgattctg ctagctggta atccttcttt 
181 tattgtctct gtgtgcgctt tttgtatgga tgtgtcgaat attttgaata tcgccgttca 
241 actttatcca gggccgcagt tcagtgattt attttctcga aaagtttgtt ttttccaata 
301 ttcatgcttc atagtctggc cggcc 

Fig. 24E 

25F1 SEQ ID NO: 128 

1 gcaggaaacc gttctccana tcctgggcga gaatcctcgg cacatgcacg ccggctccgg 

61 cgagcagtcc ggcgaccttg acgaacggtc ggcagtcttc ctggggcggc ggcgcgtcca 

121 tcaccaccag gctgcggtcc cctccctgcc agcggaaata acgacggaag ctggcgtcgc 

181 tactggccgg gatcagttcg gcggggggca cttcccccca accttcggca acgaacaact 

241 cgggcaaaca agagtccaac cagcaattca gctgctggaa acgggcatca tcagacattt 

301 acggggttct ccacggccct agccgttgcg caggtcatgc tttattatcc agcatctttt 

Fig. 24F 
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Sequences of PA14 50E12 encoding for YgdPPa and PtsPpa 

1/1 31/11 61/21 Jl/31 

GAA AAG GGC CAG ACS CAC GGG GTG ACT CCA TCG GTT GGC GGG TGG CGG gAG GGC CGC GAG AGC CTT TTG CGA AGG CTC CCA CCG GGC CTT GGG AAA aCC CCT AGC CTA CCG GCT TTT GCC 

121/41 151/51 181/S1 211/11 

GGC CCT GIA TCC TCC CCG CAC GAG TCG CAA AGC CGC GCG TTG CCG CTA TCA CAA GCT TTA TGG AAC AAT GCG GGC ACA TGC GAT TTC GAG GAT GTC CCA GCG TGA TCG ATT CCG ATG GTT 

X I D S D G F 

241/81 211/91 301/101 331/111 

TTC GCC CGA ATG TCG GCA TCA TTC TCG CCA ACG AGG CGG GGC AGG TGC TGT GGG CGC GGC GTA TCA ATC AGG AAG CCT GGC AGT TCC CGC AGG GAG GCA TCA ATG ATC GCG AAA CGC CGG 
R ? H V G I I L A X E A G Q V L X A R R I N 5 E A X 8 F ? J G G I X D R S T P E 

351/121 391/131 421/141 451/151 

AAG AGG CGC TGT ATC GCG AaT TGA ACG AAG AAG TCG GGC TGG AGG CCG GGG ACG TGC GCA TCC TGG CCT GCA CCC GCG GCT GGc TGC GCT ACC GTT TGC CGC AGC GCC TGG TGC GGA CCC 
E A L Y R E L X E S V G L E A G D V R I L A C T R G X L R Y R L P fi R L V R T E 

461/161 511/111 541/131 571/191 

ACA GCC AGC CGC TGT GCA TCG GCC AGA AGC AGA AAT GGI TCC TGC TGC GGC TGA TGT CCG ACG AGG CGC GCG TGC GCA TGG ATA TCA OCA GCA AGC CCG AGT TCG ACG GcT GGC GCT GGG 
S 5 P L C I G 5 I J S if F L L R L H S D E A I V R X D I T S I P E F D G W R X V 

601/201 631/211 661/221 691/231 

TGA GTT ACT GGT ACC CCC TGG GAC AGG TGG TGA CCT TCA AGC GCG AGG TCT ACC GCC GCG CCC TGA AGg AAC TGG CcC CGc GCC HC TGG CGC GGG ACT GAA CAC GGA GAC AAG GCC CCG 
S Y X Y P L G (J V V T F I R E V Y R R A L K E L A P I L L A R D * 

721/241 151/251 181/261 811/271 

AGC ATG CTC AAC ACG CTG CGC AAG ATC GTC CAG GAA GTG AAC TCC GCC AAG GAT CTG AAG GCG GCG CTG GGC ATC ATC GTG CAG CGG GTC AAG GAA GCC ATG GGT ACC CAG GTC TGC TCG 
H L N T L R I I V Q E V X S A I D L K A A L G I I V { R V I E A K G T J V C S 

841/281 871/231 901/301 931/311 

GTG TAC CTG CTC GAC ACC GAG ACC CAG CGT TTC GTC CTG ATG GCC ACC GAA GGC CTC AAC AAG CGT TCC ATC GGC AAG GTC AGC ATG GCc CCC AGC GAA GGC EG GTC GGC CTG GTC GGC 

VYIiLDTETQRFVLKATEGIiNKRSIGKVSHAPSEGIiVGIiVG 

961/321 991/331 1021/341 1051/351 

ACC CGC GAG GAG CCG CTC AAC CTG GAG AAC GCC GCC GCC CAC CCG CGC TAC CGC TAT 3TC GCC GAG ACC GGC GAG GAG CGC TAC GCG TCG TTC CTC GGC GCG CCG ATC ATC CAC CAT aGG 

T R E E ? L H L S N A A A H P R Y R Y F A E T G E E R Y A S F L G A P I I H H R 

1081/361 1111/371 1141/381 1171/331 

CGG GTG ATG GGG GTG CTG GTG GTG CAG CAG AAG GAG CGC CGC CAG TTC GAC GAA GGC GAG GAg GCC TTC CTC GTC ACC ATG AGC GCC CAG CTC GCC GGG GTC ATC GCG CAT GCC GAG GCG 

R V H G V L V V 5 M E R R { F D E G E E A F L V T H S A 0 L A G V I A 1 A E A 

1201/401 1231/411 1261/421 1291/431 

ACC GGT TCG ATC CGC GGC CTG GGC AAG CTC GGC AAG GGC ATC CAG GAA GCC AAG TTC GTC GGC GTG CCC GGC GCC CCC GGG GTC GGG GTG GGC AAG GCG GTG GTG GTG TTG CCt CCG GCC 

TGSIRGLGKLGRGIQEAKFVGVPGAPGVGVGKAVVVLPPA 

1321/441 1351/451 1331/461 1411/471 

GAC CTG GAA GTG GTG CCG GAC AAG CAG GTC GAC GAC ATC GAC GCC GAG ATC GCC CTG HC AAG CAG GCC CTG GAG GGC GTT CGC GCC GAC ATG CGC GCG CTG TCG AGC AAG CTC GCC AGC 

0 L E V V P D I 0 V D D I D A E I A L F K 5 A L E G V R A D H R A L S S I L A S 

1441/481 1471/491 1501/501 1531/511 

CAG tTG CGC AAG GAA GAA CGC GCG CTG TTC GAC GTC TAC CTG ATG ATG CTC GAC GAT GCC TCC ATC GGC AAC GAG GTC AAG CGC ATC ATC CGT ACC GGC CAG TGG GCC CAG GGC GCC CTG 

Q L I I E E R A L 1 D V Y L X K L D D A S I G N E V I I I I R T G J X A J G A L 

1561/521 1591/531 1621/541 1651/551 

CGC CAG GTG GTG ATG GAG CAC GTG CAG CGC TTC GAG CTG ATG GAC GAC GCC TAT CTC CGC GAG CGC GCC TCC GAC GTC AAG GAC ATc GGT CGC CGC CTG CTC GCC TAC CTq CAG GAA GAa 

1 5 V V H E E V Q R F E L X D D A Y L R E R A S D V I D I G R R L L A Y L Q S E 

1681/561 1711/571 1741/581 1771/591 

CGC AAG CAG AAC CTG ACC TAC CCG GAg CAG ACC ATC ATC GTC AGC GAG GAG CTG TCG CCG GCG ATG CTC GGC GAG GTG CCG GAA GGG CGC CTG GTC GGC CTG GTC TCG GTG CTC GGC TCG 

I I fiH L T Y P E 8 T I I V S E E L S P A X L G I V P E G R L V G L V S V L G S 

1801/601 1831/611 1861/621 1891/631 

GGC AAC TCG CAC GTG GCG ATC CTC GCC CGT GCC ATG GGC ATC CCC ACG GTG ATG GGG GCG GTC GAC CTG CCG TAC TCC AAG GTC GAC GGt ATC GAC CTG ATC GTC GAT GGC TAC CAC GGC 

G X S H V A I L A R A X G I P T V X G A V D L P Y S I V D G I D L I V D G Y B G 

Fig. 24H 



113/133 



1921/541 1951/651 1981/551 2011/571 

GAG GTC IAC ACC AAC CCC ICC GCC GAG CTG GIG CGC CAG TAC AGC GAC GTG GTC GCC GAG GAG CGC GAG CTG AGC AAG GGC CTG GCG GCC CTG CGC GAG CTG CCC IGC GAG ACC CTC GAC 

E V Y T N P S A E L V R Q Y S 0 V V A E E I E L S X G L A A L R E L P C E T L D 

2041/531 2071/591 2101/701 2131/711 

GGC CAC CGC ATG CCG CTC TGG GTC AAC ACC GGC CTG CTC GCC GAT GTC GCC CGC GCC CAG GAG CGT GGC GCC GAG GGC GTG GGC CTG TAC CGC ACC GAA GTG CCG TTC ATG ATC AAC GAC 

G E R I P L X V X I G L L A D 7 A R A 0 E R G A S G V G L Y R T E V P F X I N D 

2151/721 2191/731 2221/741 2251/751 

CGC TTC CCC AGC GAG AAG GAA CAG CTG GCG ATC TAC CGC GAG CAG CTC AGT GCC TTC CAC CCG CTG CCG GTG ACC ATG CGC ACC CTG GAT ATC GGC GGC GAC AAG GCG CTG TCC TAC TTC 

RFPSEKEQLAIYREQLSAFHPLPVTHRTIiDIGGDKAIiSYF 

2281/761 2311/771 2341/781 2371/731 

CCG ATC AAG GAA GAC AAC CCG TTC CTC GGc TGG CGC GGC ATC CGC GTC ACC CTC GAC CAC CCG GAG ATC TTC CTG GTC CAG ACC CGC GCC ATG CTC AAG GCC AGC GAA GGA CTG GAC AAC 

P I I E D N P F L G X R G I R V T L D E P E I F L V J T R A X L X A S E G L D S 

2401/801 2431/811 2461/821 2491/831 

CTG CGC ATC CTG CFG CCG ATG ATC TCC GGC ACC CAC GAG CTG GAA GAG GCC CTG CAC CTG ATC CAC CGC GCC TGG GGC GAG GTG CGC GAC GAG GGC GTG GAC ATC GCC ATG CCG OCt ATC 

L R I L L ? X I S G T B I L E E A L H L I E R A X G E V R D E G V D I A X P P I 

2521/841 2551/851 2581/851 2611/871 

GGC ATG ATG GTC GAG ATT CC£ GCC GCC GTG IAC CAG ACC CGC GAG CTG GCC CGt CAG GTC GAC TTC CTT ICG GTC GGT TCG AAC GAC CTG ACC CAG TAC CTG CTG GCG GTC GAC CGC AAC 

GHMVEIPAAVYQTRELARQVDFLSVGSHDLTJYLLAVDRS 

2641/881 2671/891 2701/901 2731/911 

AAT CCG CGG GIC GCC GAC CTC TAC GAC TAC CTG CAT CCG GCC GTg CTG CAT GCG TTG AAG AAG GTG GTC GAC GAT GCC CAC CTG GAA GGC AAG CCG GTG AGC ATC TGC GGC GAG ATG GCC 

X P I V A D L Y D Y L H P A V L 1 A L I I V V D D A H L E G I P V S I C G E X A 

2761/921 2791/S31 2821/941 2851/951 

GGC GAT CCC GCG GCT GCC GTG CTG CTC ATG GCG ATG GGC TIC GAC AGC CTG TCG ATG AAC GCC ACC AAC CTG CCC AAG GTG AAG TGG CTG CTG CGC CAG ATC ACC CIG GAC AAG GCC CGG 

G D P A " A A V L L X A X "G F D S L S M N A T" X L P I V K X L L R Q I T L D K A R 

2881/961 2911/971 2941/981 2971/391 

GAC CTG CTC GGC CAG TTG CTC ACC TTC GAC AAC CCG CAG GTC ATC CAC AGC TCG CTG CAC CTG GCG TTG CGC AAC CTC GGC CTG GGT CGC GTG ATC AAC CCG GCG GCT ACC GTC CAG CCC 

D L L G 0 L L T F D K P { V I E S S L H L A L R N L G L G R V I S P A A T V Q P 

3001/1001 
TGA TTT TCC C 



Fig. 241 
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Sequence of PA14 35A9 encoding mtrRPa 

1/1 31/11 61/21 91/31 

6TC GAT TIG 65A CAS CAC GGT GCC GGC GCG GAC TgC CTG GCC TTC CTC GTA CAG GCG ACG GGT GAC GAT GCC GGC GAC GCG CGC CCG CgC cTC gGC CTG GCG GTA CGC TTC CAG GCG TCC 
121/41 151/51 181/61 211/71 

GGG (AG C!C GCT GGT GAT GCC GAT gGG CGC CGG OCT GGC GAC GAT CAC GCC GAC CTC GGC GGG GGC CTC CGC AGT CTC CCC GGt GTC CGC TGC TTC TTC GCA GCC CAG CAG GAA TAG GGC 
241/81 271/91 301/101 331/111 

GAC CAG GGC CGC CAG CAG CCC GCG CAG CGA GCC GGT CCA TTG GAT GTG CAT GGG TGT CCC TCG ATT CGT GAA CTC GCG AGC TTG CCC GGG AAg GGG CAC CGC AAC TCA CGA GCG GCG CGA 
361/121 391/131 421/141 451/151 

CAG TAG CAT ATA ATC AAT CAT GAG TGA cTA ATT AAT TGG CGT TTC TGT AAC ATA TCC TTA TGA TCT GCG GCG CCT TTC CCT TGT GAG GAC GTT CAG TGG CCA GGA AAA CCA AAG AGG AAT 

HARKTKEES 

481/161 511/171 541/181 571/191 

CCC AGA AAA CCC GCG AtG GCA TAC TCG ATG CCG CCG AGC GGG TTT TCC TGG AAA AGG GCG TGG GCA CCA CTG CCA TGG CCG ACC TGG CGG ACG CCG CCG GGG TTT CTC GOG GTG CGG TCT 
Q K T S D G I L D A A I R V F L E I G V G T T A M A D L A D A A G V S I G A V Y 

601/201 631/211 661/221 691/231 

ACG GCC ACT ACA AGA ACA AGA TCG AGG TCT GtC TGG CGA TGT GCG ACC GCG CCT TCG GCC AGA TCG AGG TAC CCG AtG AAA ACG CCA GGG TGC CGG CGC TGG AcA TCC TCC TGC GCG CCG 
G H J I N I I E V C L A H C D I A P G Q I E V P D E I A R V P A L D I L L R A G 

721/241 751/251 781/261 811/271 

GCA TGG GCT TTC TCC GCC AGT GCT GCG AaC CCG GTT CGG TGC AGC GGG TGC TGG AGA TCC TCT ACC TCA AGT GCG AAC GCA GCG ACG AGA ACG AGC CGC TGT TGC GCC GCC GCG AGC TGC 
HGFLRJCCEPGSVQRVLEILYLKCERSDENEPLLRSRELL 

841/281 871/291 901/301 931/311 
TCG AGA AGC AGG GGC AAC GCT TCG GCC gaC GGC AGA TCC GCC GGG CGG TGG AgC GCG GCG AAC TGC CGG CGC GGC TGG ACG TCG AGC TGG CCA GCA TCT ATC TGC AAT CGC TgT GGG ACG 
E~r J-G -Q 1 F- G -R-R-D— I — R-- -R A V E — R G E L P A R L D V E L A S I Y L 5 S L « D G 

961/321 991/331 1021/341 1051/351 

GCA TCT GCG GCA CCC TGG CCT GGA CCG AGC GCt TGC GCG ACG ATC CCT GGA gCC GCG CCG AAC GCA TGT TCC GCG CCG GCC TCG AtA GCC TGC GCA GTT CTC CCT ACC TcT TGC TGG CGG 
I C G T L A 8 T E S L 1 D D P K S i A E R I F R A G L D S L R S S P Y L L L A D 

1081/361 1111/371 1141/311 1171/391 

ACG CCT GAG GGC GTC AAT CGT CCG CCA TCA GGT GCC TGC GCT GGT CCT CGG CGC CGG CGA CCA CCA GCC GCT GGG CGT CcT CCT CGC TGA TGT GCA GGC GCT TGC CaT CGA TGT AGA GCA 
A * 

1201/401 1231/411 1261/421 1291/431 

COG ACA GGC GCG CCT CGG CGT CGG TAC CGA TGC GCA GGC TGT CGA CCG GCG CGC GAT GCC GGC TGC CTT CGA TCT CCA CGC TGC AGA tGC CTT GTT CCG AAT CGA TTT CGA TGG ACA TGG 

1321/441 1351/451 1381/461 1411/471 

GAa CCT CCc GTT TTc TCC GCC TAC CTT GGG TGG ACC CCG GGC ATC CGC GCG GGT TCT GTC ACG GTA GCT TCA CGC CAG CGT CAC GCG CCT GOC ACC GCG CTT GGC TGC AAT COT CCG CAG 

1441/481 1471/491 1501/501 

AGA aGG CGA GGC CAG CGG AGG ACG ACG CCA TGC GGC TAT GCG TGA TTG GTG CGG GCT ATG TGG GAC TGG TGA 
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Sequences of PA14 25F1 encoding for orfl", OrfU and DjIAPa 

1/1 31/11 61/21 91/31 

CGA GGA ATC CAG TCG AGG TGC GAg TAG TCC 6CA CTG CGG GAT cTC AGC GCG CGA CCa CCG GAC TCG GTG ACC AGG CGC TGG GTC GGC TCT GCC TCG ACG GTT TCG CCT CCG CTG CCG GAC 

121/41 151/51 181/61 211/71 

ACG CTG CTG CCC GCC GCG GCG GTG CTG ACC GAG GTC GCG GTA TGC GCC GGG CGC GGI GGC AGG TTG GCA TTG GCG TTC TGC AGC GGG GAG CAA TCC CAG CCG CCG GTG GCC GAT ACC TTG 

241/81 271/91 301/101 331/111 

CAG TCG AAC TGA TCG GCG GCC TGT ACA GTC AAT GCT GCG ACC GGC TGC AGA GCC AGC AGG CTG CCG GTG ACC AGC AGG GGA AAC TTT CTT CGA AAC ACG AGG GAT TTC ACT GCC ATC TTG 

361/121 391/131 421/141 451/151 

TTA ATC CGG GCT TCC TGC GCG CCA TCG GCC CGG TGG GCC GCA CGC CTC TCG ATG GGC TGA AAA AGA TGC TGG ATA ATA AAG CAT GAC CTG CGC AAC GGC TAG GGC CGT GGA GAA CCC CGI 

481/161 511/171 541/181 571/191 

AAA TGT CTG ATG ATG CCC GTT TCC AGC AGC TGA ATc GCT GGT TGG ACT CT! GTT TGC CCG AGT TGT TCG TTG CCG AAG GTT GGG GGG AAG TGC CCC CCG CCG AAC TGA TCC CGG CCA GTA 
K S D D A R F 8 J L N R W L D S C L P E L ? V A E G H G E V P P A E L I P A S S 

601/201 631/211 661/221 691/231 

GCG ACG CCA GCT TCC GTC GTT ATT TCC GCT GGC AGG GAG GGG ACC GCA GCC TGG TGG TGA TGG ACG CGC CGC CGC CCC AGG AAG ACT GCC GAC CGT TCG TCA AGG TCG CCG GAC TGC TCG 
D A S F R R Y ? R » 0 G G D R S L V V M D A P P P fl E D C R P F V I V A G L L A 

721/241 751/251 781/261 811/271 

CCG GAG CCG GCG TGC ATG TGC CGA GGA TTC TCG CCC AGG AtC TGG AGA ACS GTT TCC TGC TGC TCA GTG ACC TGG GCC GGC AGA CCT ACC TCG ACG TGC TCC ATC CCG GaA ATG CCG ACG 
G A G V B V P R I L A Q D L E N G F L L L S D L G R 0 T Y L D V L H P G N A D E 

841/281 871/291 901/301 931/311 

AGC TGT TCG AAC CGG CCC TGG ATG CGC TGA KG CCT TCC AGA AGG TCG ATG TCG CCG GTG TCC TGC CTG CCT ACG ACG AAG CGG TGC TGC GCC GCG AGC TGC AGC TGT ICC CCG ACT GGT 
LFEPALDALIAFQKVDVAGVLPAYDEAVLRREIiQLFPDHY 

- 951-/3-21- --- 991/331 - - - - - 1021/341 1051/351 

ACC TGG CCC GCC ACC TCG GCG TGG AGC TGG AGG GCG AGA CGC TGG CCC GCT GGc AgC GGA TCT GCG ACC TGC TGG TAC GCA GCG CGC TGG AGC AAC CGC GGG TGT TCG TCC ATC GCG ACT 
L A R H L G V B L E G I T L A R H Q R I C D L L V R S A L E Q P R V F V H R D Y 

1081/361 1111/371 1141/381 1171/391 

ATA TGC CGC GCA AcC TGA TGC TCA GCG AGC CCA ACC CGG GCG ICC TCG ACT TCC AGG ACG CCC W ACG GCC CGG TCA CCT ACG ATG TCA CCT GCC TGT ACA AGG AtG CCT TCG TCA GTT 
HPRNLHLSEPNPGVLDFQDAIiHGPVTYDVTCLIKDAFVSfl 

1201/401 1231/411 1261/421 1291/431 

GGC CGG AGC CGC GCG TGC ATG CCG CGC TGA gtC GTT ACT GGA AGA AGG CGA CCT GGG CCG GCA TCC CGC TGC CGC CAA GCT TCG AgG ACT TCC TCC GcG CCA GCG ACC TGA TGG GCG TGC 
P E P R V H A A L S R Y H I I A T W A G I P L P P S F E D F L R A S D L M G V Q 

1321/441 1351/451 1381/461 1411/471 

AGC GCC ACC TGA AGG TGA TTG GCA TCI TCG CCC GTA TtT GTC ACC GCG ACG GCA AGC CGC GCT ACC TGG GTG ACG TGC CaC GCT TCT TCC GTT ATC TGG AAA CCG CCG TGG CGC GCC GTC 
R H L K V I G I F A R I C H R D G I P R Y L G D V P R F F R Y L E I A V A R R P 

1441/481 1471/491 1501/501 1531/511 

COG AGC TGG CCG AAC TGG GCG AGC TGC TGG CCT CGC TGC CGC AGG GAG CCG AGG CAT GAA GGC GAT GAT CCT CGC CGC CGG cCG TGG CGA GCG CAT GCG GCC GAC CAC CCT GCA CAC GCC 
H K A H I L A A G R G 2 R K R P T T L R T ? E L A E L G E L L A S L P Q G A E A 

1561/521 1591/531 1621/541 1651/551 

CAA GCC GCT GAT CGA GGC CGC CGG CGI GCC ATT GAT CGA GCG TCA GTT GCT GGC GCT GCG CCA GGC CGG AGT CGA CGA CTG GGT GAT CAA CCA TGC CTG GCT TGG CGA GCA GAT CGA GGC 
I P L I E A A G V P L I E R Q L L A L R Q A G V D D V V I N E A K L G E { I E A 
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1681/561 1711/571 1741/561 1771/551 

CTA TCT C66 CGA CGO cTC 6C6 CCT GGG C66 GCG GAT CGC CTA TTC aCC cGA GGG aQA ACC GCT GGA AAC CGG cGG tGG AAT CTT CCG CGC CCT GCC GTT GCT CGG CGA GCA GCC GIT CCT 
YLGDGSRLGGRIAYSPEGEPLETGGGIFRALPLLGEQPFL 

1801/601 1831/611 1861/621 1891/631 

GTT GCT CAA CGG CGA TGT CTG GAG CGA CTT CGA CTA CTC TCG GCT GCA TCT TGC OGA CGG CGA CCT GGC GCA TCT GGT GCT GGT CGA CAA CCC GGC GCA CCA TCC CGC CGG CGA TTT CCA 
L L H G D V V S D F D I S R L B L A D G D L A H L V L V D X P A H B P A G D F B 



1321/641 1951/651 1981/661 2011/671 

CCT GGA TGC CGG CGG ACG GGT GGG CGA GAC CCG CGA AGC GGG CGG CAA CCT gAC CTA CAG CGG GAT CGC CGT ACT GCA TCC CGC GCT GTT CGA GGG CTG CCA GCC GGG CGC CTT CAA GCT 
LDAGGRVGETREAGGNLTYSGIAVLHPALFEGCQPGAFKL 

2041/681 2071/691 2101/701 2131/711 

GGC GCC GCT M GCG CAA GGC m CGC CGC GGG GCG GGT CAG CGG CGA ACA CtA TCG TGG GCA GIG GGT CGA CGT CGG TAC CCA CGA GCG CCT GGC GGA AGT CGA GCG ATT GCT GGC GGA 
A P L L R K A I A A G R V S G E H Y R G Q H V D V G T E E R L A E V E R L L A E 

2161/721 2191/731 2221/741 2251/751 

GCA CGC CTG AGA TGC TCT GGC CCG CTA CGC TGA TCG GAG CCG GAG CCG GCT GGG CCC TGG CCA GCA TCC CCG GCG CCC TGC TCG GCG GCC TGC TGG GGC AAC TGC TGG ACC GCA GGT TGC 
HA* K L K P A T L I G A G A G N A L A S I P G A L L G G L L G Q L L D R R L I 

2231/761 2311/771 2341/781 2371/791 

GCC TGG AGT CCT GGC GCG GCC TGC TGG CGC GCT TGC GCG GGC GGG CGG TGA ACG ATG AGG ACG ACC TGC TGT TcC AGT TGC TCG GCT ATC TGG CCA AGA GCG GCG GGC GGG TGG AGG AGA 
LESHRGLLARLRGRAVNDEDDLLFQLLGYLAKSGGRVEEM 

2401/801 2431/811 2461/821 2491/831 

TGC ATA TCC GCC AGG CGC GCG AGG AGA TGG CGT TGC GCA AGC TCG ATA GGC GAG CCC AGC GGC GTG CCA TCG CGT CCT TCG GCA AGG GCA AGG CCG GCA TCG CCC ATC TGC AGG CGG AGG 
B I R 5 A R E I K A L R it L D R R A { R R A I A S F G I G I A G I A H L Q A E V 

2521/841 2551/851 2581/861 2611/871 

TCS CGC GTC TGA-AGG GCGAAC GTG CGG AGG CAG -TAT- TGC TCG CCT-GCT GGC GGA -TGG CCT GGG CTG GCG GCG TGC TCA GCC AGT CGG -CGC -GAC AAC TGG TGT TGC AAT GGG GGC GCT GGC 
ARLKGERAEAVLLACWRHAffAGGVLSQSARQLVLQflGRffL 

2641/881 2671/891 2701/901 2731/911 

TGG GTI GOT CGG CGG AGC GAA CGG AAC GCT TGT CGG CGC GGG TCA TGC CGA AGC GGA CGC GCG CTG TCG CCC GGG ATA GCT ACC GTG AGG CCC TCC TGC TGC TCG GCG TGG AGG CCG GAA 
G H S A E R T E R L S A R V M P I I I R A V A R D S Y R I A L L L 1 G V E A G S 

2761/921 2791/931 2821/941 2851/951 

GCG AGC CGG CGC TGA TCA AAC GCG CCT ATC GCA AGC TGA TCA GCC AGC ATC ATC CGG ACA AAC TGG CGG GAG CCG GCG CCA GCG TCG AGC GCG TGC GTG CGG CTA CCG AGA AAA CCC GTG 
E P A L I K R A Y R I L I S Q H E P D K L A G A G A S V E I V R A A T E K T R E 

2881/961 2911/971 2941/981 2971/991 

AAT TGC AGG CGG CCT ACG CCC TGG TCC GAG AGC GTG AGG GGT TCC GCT GAT CAC TCC GCA GGT HC TGC GCA TCG GCC TGC AGG TGA AGA CTG AGC CAG CCG CGG ATT CGT CGG TAC AGT 
LQAAYALVREREGFR* 

3001/1001 3031/1011 3061/1021 3091/1031 

TGC TCC TGC TCC GCC TTG GGG TCG GCC GGT AGa GCC TGC ATC GCG ATT TGT ACG TAG GCc GGG TGT TTC TGC CGC TTG CCG GCC TGC ATG CGC AGC CTC GCC GCC TCG CGG TCG GCG CG 
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PhnA and PhnB SEQ ID NO: 129 

1 CTGCAGC6TC TGCCGACCCT 
51 GCTGCGGGGG CGGCTGGCCG 
101 GTCTGCGCCT GTGCCGACGG 
151 CTCGACGAAC TGAGCGAGGA 
201 CGACTTCCTG CCCGGCGAAC 
251 AGATTCTCTC CCGCCAGGCG 
301 GCGGCGATCG CGCCCGACGG 
351 TCGATGATGA ATATGCCGTT 
401 ATGGGCGCGG GGAGGTGGCT 
451 AAGTCTGGAG TACCGCACGC 
501 TGGCTGGCGC CAACCGGATG 
551 GCTGCGCGGC GTAGCGTGGC 
601 CTGGGGGCGC GATGTGGTGC 
651 TGCTGGCGCC GTTGAGCGAG 
701 GACGGCGACA CCCTGCACTG 
751 CGAGTGGCGA CGCCTGCATG 
801 TGCTCGGAAC GCTGGGCGAC 
851 TTCAGTTTCG ACCTGGCCGA 
901 ACCTGCGCGG CATTGCCCGG 
951 TGGATATCGA TCAGCTGGCG 
1001 GATCCGGCCG GGCACGACCG 
1051 CGAATTCCAT GGCGCCGTGG 
1101 TACGGGCCGG CAACTACCAG 
1151 CAGGTAGAAC GCCTGCAGGC 
1201 CGTACCTTCG CGCAGCTTCA 
1251 ATCGCCAGTT GTGCCTGCGC 
1301 GCGGGGGACT TCTGCCTGTT 
1351 CGACGCGGAG AGTCGCGAGG 
1401 CGCGCGGATG CGATGCCCGG 
1451 CTGGAAGCGG AGTTGCGCCT 
1501 GCTGGTCGAC CTGGCGCGCA 
1551 CCCGGCAGGT GCGCGACATG 
1601 CACCTGGTCT CGCGCGTGGC 
1651 GCATGCCTAC CGTGCCTGCC 
1701 AGGTCCGTGC CATGCAGTTG 
1751 AGCTACGGTG GTGCGATCGG 
1801 CAGCATTGTC ATCCGCTCCG 
1851 GGGCAGGCGC CGGCGTGGTG 
1901 GAAACCCGCA ACAAGGCGCT 
1951 ACGCGAAAGG GGAGAGCGCG 
2001 TCCTTCACCT ACAACCTGGT 
2051 CCGGGTGATG CGCAACGATA 
2101 TGGCCGACGG TTGCGAACTG 



GCTGCAACTG ATCCCGGGAC ACGGCGGCCT 
CGGATGGGGC CGAGTCGGCC TATACCGAGT 
TTGCTCTGGC GCCAGTCCAT GGGCGAGTCC 
GCTGCACCGC GCCTGGGGAG GGCAGAGCGT 
TGCACCTGGG GAGCATGCGC CGGATGCTGG 
CTGCCTCTGG ACTGAGGCGG AACATCCATT 
CTGCGGTCGC AATTGGGGGA AATGGGGGTA 
GCGCGCTAGC GTCGCGCAGG CCAGTCGCCC 
CGTGAGTGGG GTTGGCTATC GACTGGAAGA 
TGGTGCCGGA GGCGCTGTCG ATCTGGCGCA 
CTGTTCGACT GCTTCGACGT GGACAGCAAG 
GATCCTTTCC AGCTGCCTGC GCATCGAGTG 
TGCGGGCGTT GAACTCCAAC GGACGCGCCT 
GCCTGTCCGG CCCAGGTCAC CTGCTTGCGT 
GCGCTTCCCC CCGGAAGAGC CGCATGCGGA 
GCCTGTCCAG CCTGGAGGCG CTGCGCCGCG 
GCGGAGGGGC CTGCGCTGCT GGGCGGCCTG 
GCAGTTCGAA CCCTTGCCGG CGCCGGCCGA 
ACTACCTGTT CCTGGTGCCG GAGTTGCTGC 
CGCCGGACTT CGCTGCAAGG GTTCGTCCAC 
GTTGGCCGCC AGCCTGCGCC AATGTGCCGA 
AGGAGGCTTC CGAGTCGCCG GTGGCAGGCG 
GTCGACCTGG ACGATGCGAG CTTTGCCCGC 
CCACGTGAGG GCCGGCGACG TGTTCCAGAT 
GCATGCCGTG CGCGGACCCC TGGCGGGCCT 
AACCCCAGCC CGTACCGCTT CTTCCTCGAT 
CGGCGCTTCG CCGGAGTCGG CATTGAAGTA 
TGGAACTCTA TCCCATTGCC GGCACCCGCC 
GGCGCCATCG ATGCGGAACT GGACAATCGC 
GGATGCCAAG GAGATCGCCG AGCACATGAT 
ACGATCTGGC GCGCGTCTGC CGCAGCGGTA 
CTCAAGGTCG ATCGCTACAG CCACGTGATG 
TGGCGAACTG CACGGCGAAC TGGATGCGCT 
TGAACATGGG CACCCTGGTC GGCGCGCCGA 
CTGCGGCAGT ACGAGGATGG CTATCGCGGC 
CATTCTCGAC AGCGCCGGCA ACCTCGATAC 
CCGAGGTCCG CGAAGGTATC GCGCGGGTTC 
CTGGATTCGG ATCCACGGCT GGAGGCCGAG 
GGCGGTGCTG ACCGCCGTGG CCGCTGCCGA 
ATGCGCATCA CGCTGTTGGA TAACTTCGAT 
CGAGCAGTTC TGCCTGCTCG GCGCGGAGGT 
CGCCGTTGCC GACGATCCAG GCGGCATTGC 
CTGGTGCTGT CGCCGGGGCC CGGTCGGCCG 
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2151 GAAGACGCCG GCTGTATGCT GGAATTGCTC GCCTGGGCCC GCGGGCGCTT 
2201 GCCGGTGCTC GGCGTCTGCC TCGGCCACCA GGCGCTGGCG CTGGCCGCCG 
2251 GTGGCGCGGT GGGCGAGGCG AGGAAGCCGC TGCATGGCAA GAGCACGTCC 
2301 CTGCGTTTCG ATCAGCGTCA CCCGCTGTTC GACGGCATCG CTGACCTGCG 
2351 CGTCGCGCGC TACCACTCGC TGGTGGTCAG TCGCCTGCCG GAAGGTTTCG 
2401 ACTGCCTGGC CGATGCCGAT GGCGAGATCA TGGCGATGGC CGATCCGCGC 
2451 AATCGACAGC TGGGCTTGCA ATTCCATCCC GAGTCGATTC TCACCACCCA 
2501 CGGCCAGCGT CTGCTGGAGA ACGCTCTACT CTGGTGCGGC GCGTTGGCGG 
2551 TCGCGGAGCG CCTTCGGGCC TGAGCGGCGC TGCGCAGTTT CGACCGAGGC 
2601 TCGGTTGCCA GGCCGGCGCA TCGTCGAAAC GCTGGCGGCC CAGTTCGCGC 
2651 AGGCGCTGGC GGGCGCTTTC GAGAAAGCGA CGGAAGCTGC GCTCGGATTC 
2701 CAGCGCGGTG TTGTAGTAGC AATACACCTT GGTGTCGATG CCGCCCGGTT 
2751 CGTACAGTTC GCTGAGGACT GCCAGGGTAC CGTTGCGCAG GCGTTCCTcG 
2801 ACGAAATAAT GCGGCGaGAT GCCCCATCCG ACGCCGGCTT CCACCAGACG 
2851 CAGCATGTCG TCGAAGTTTT CCACGAAGAG CACCTTGTCG CTGACCGGCC 
2901 GCAGCAGGTT CGAATGCTGC CCGGAGCGGC TgCCGAGGCT GATCTGCCGG 
2951 TAATTGGCCA GGCTCGCGAT GCTGTGCAGG GAGGCATTGC ACAACGGGTG 
3001 CTGCGGATGG GCGACGACGA ACGCCTTGGT GTAGCCGAGC ACGCACTGGT 
3051 TGAAGCGGGA GATCT 
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PhnA protein SEQ ID NO:130 

1 MGARRWLVSG VGYRLEESLE YRTLVPEALS 

51 AARRSVAILS SCLRIECWGR DWLRALNSN 

101 DGDTLHWRFP QEESHADEWR RLHGLSSLEA 

151 PSFDLAEQFE PLPAPAEPAR HCPDYLFLVP 

201 DPAGHDRLAA SLRQCADEFH GAVEEASESP 

251 QVERLQAHVR AGDVFQIVPS RSFSMPCADP 

301 AGDFCLFGAS PESALKYDAE SREVELYPIA 

351 LEAELRLDAK EIAEHMMLVD LARNDLARVC 

401 HLVSRVAGEL HGELDALHAY RACLNMGTLV 

451 SYGGAIGILD SAGNLDTSIV IRSAEVREGI 

501 ETRNKALAVL TAVAAAERER GERDAHHAVG 

Fig. 26 

PA14 degPSEQ ID NO:131 

1 CGTCCGATTC GGCCTGAGTC TTTCTCTTCC CTCGAACATC ACGGGAGCTG TAGTCGATGC 
61 ATACCCTAAA ACGCTGTATG GCTGCGATGG TGGCCTTGCT GGCCTTGAGC CTGGCGATGA 
121 CGGCCCGGGC "AGAACTGCCG GACTTCACGC CTTTGGTCGA ACAGGCGTCG CCGGCGGTGG 
181 TGAATATCAG TACGCGGCAG AAGCTGCCGG ATCGCGCCAT GGCGCGCGGG CAGCTGTCGA 
241 TCCCCGACCT CGAAGGGCTG CCGCCGATGT TCCGCGACTT CCTCGAGCGC ACGATCCCGC 
301 AGGTTCCGCG CAATCCGCGC GGCCAGCAGC GCGAGGCGCA ATCGCTGGGC TCCGGCTTCA 
361 TCATCTCCAA CGACGGCTAC ATCCTCACCA ACAATCACGT CGTGGCCGAT GCCGACGAGA 
421 TCCTGGTGCG CCTGTCCGAC CGTAGCGAGC ACAAGGCCAA GCTGGTCGGC GCGGACCCGC 
481 GCAGCGACGT GGCGGTGCTG AAGATCGAGG CGAAGAACCT GCCGACCCTG AAACTGGGCG 
541 ATTCGAACAA GCTGAAAGTG GGCGAATGGG TCCTGGCCAT CGGTTCGCCG TTCGGCTTCG 
601 ATCACTCGGT CACCGCCGGT ATCGTCAGTG CCAAGGGGCG TAGCCTGCCG AACGAGAGCT 
661 ACGTACCCTT CATCCAGACC GACGTGGCGA TCAACCCGGG CAACTCCGGC GGTCCGCTGC 
721 TGAACCTGGA GGGCGAAGTG GTCGGCATCA ACTCGCAGAT CTTCACCCGT TCCGGCGGCT 
781 TCATGGGCCT GTCCTTCGCC ATCCCGATCG ATGTCGCGCT GAACGTCGCC GACCAGTTGA 
841 AGAAAGCCGG CAAGGTCAGC CGCGGCTGGC TGGGTGTGGT GATCCAGGAA GTGAACAAGG 
901 ATCTCGCCGA GTCCTTCGGC CTCGACAAGC CGTCCGGCGC GCTGGTGGCG CAGCTGGTGG 
961 AAGACGGTCC GGCGGCCAAG GGCGGCCTGC AGGTGGGCGA TGTGATCCTC AGCCTGAACG 
1021 GCCAGTCGAT CAACGAGTCC GCCGACCTGC CGCACCTGGT GGGCAACATG AAGCCGGGCG 
1081 ACAAGATCAA CCTGGACGTG ATTCGCAACG GCCAGCGCAA GTCCTTGAGC ATGGCGGTAG 
1141 GCAACCTTCC GGACGACGAC GAGGAAATCG CCTCGATGGG CGCTCCGGGC GCCGAGCGCA 
1201 GCAGCAACCG CCTGGGCGTG ACCGTCGCCG ACCTGACCGC CGAGCAGCGC AAGAGCCTGG 
1261 ATATCCAGGG CGGCGTGGTG ATCAAGGAAG TCCAGGACGG TCCGGCCGCG GTCATCGGCC 
1321 TGCGTCCGGG CGATGTCATC ACCCACCTGG ACAACAAGGC GGTGACCTCG ACCAAGATCT 
1381 TCGCCGACGT GGCCAAGGCC CTGCCGAAGA ACCGTTCGGT TTCGATGCGG GTACTG 



IWRMAGANRM LFDCFDVDSK 
GRALLAPLSE DCPAQVTCLR 
LRRVLGTLGD AEGPVLLGGL 
ELLLDIDHLA RRTSLQAFVH 
VAGVRAGNYQ VDLDDASFAR 
WRAYRQLCLR NPSPYRFFLD 
GTRPRGRDAR GAIDAELDNR 
RSGTRQVRDM LKVDRYSHVM 
GAPKVRAMQL LRQYEDGYRG 
ARVRAGAGW LDSDPRLEAE 
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PA14 degP protein SEQ ID NO: 132 



TPLVEQASPA WNISTRQKL 
PQVPRNPRGQ QREAQSLGSG 
EHKAKLVGAD PRSDVAVLKI 
FDHSVTAGIV SAKGRSLPNE 
INSQIFTRSG GFMGLSFAIP 
KDLAESFGLD KPSGALVAQL 
LPHLVGNMKP GDKINLDVIR 
RSSNRLGVTV ADLTAEQRKS 
LDNKAVTSTK IFADVAKALP 



Fig. 28 

PA 8830 algD SEQ ID NO:133 



1 MHTLKRCMAA MVALLALSLA MTARAELPDF 
51 PDRAMARGQL SIPDLEGLPP MFRDFLERTI 
101 FIISNDGYIL TNNHWADAD EILVRLSDRS 
151 EAKNLPTLKL GDSNKLKVGE WVLAIGSPFG 
201 SYVPFIQTDV AINPGNSGGP LLNLEGEWG 
251 IDVALNVADQ LKKAGKVSRG WLGWIQEVN 
301 VEDGPAAKGG LQVGDVILSL NGQSINESAD 
351 NGQRKSLSMA VGNLPDDDEE IASMGAPGAE 
401 LDIQGGWIK EVQDGPAAVI GLRPGDVITH 
451 KNRSVSMRVL 



1 GCGCGACAAA CAATCGAGGT GAATGCGATG CGAATCAGCA TCTTTGGTTT 
51 GGGCTATGTC GGTGCAGTAT GTGCTGGCTG CCTGTCGGCA CGCGGTCATG 
101 AAGTCATTGG TGTGGATGTC TCCAGCACCA AGATCGACCT GATCAACCAG 
151 GGCAAGTCGC CCATCGTCGA ACCGGGCCTG GAAGCGTTGT TGCAGCAAGG 
201"CCGGCAGACC~GGACGGCTGT" CGGGCACCAC CGAGTTCAAG- AAGGCTGTGC 
251 TGGACTCCGA CGTATCGTTC ATCTGCGTCG GCACGCCGAG CAAGAAGAAC 
301 GGCGACCTGG ACCTGGGCTA CATCGAGACC GTCTGCCGCG AGATCGGCTT 
351 CGCCATCCGC GAGAAGTCCG AACGCCACAC CGTGGTGGTG CGCAGCACCG 
401 TACTGCCGGG CACCGTCAAC AACGTGGTGA TCCCGCTGAT CGAGGACTGC 
451 TCGGGCAAGA AGGCCGGGGT CGACTTCGGC GTCGGCACCA ACCCCGAATT 
501 CCTCCGCGAG AGCACCGCGA TCAAGGACTA CGACTTCCCG CCGATGACCG 
551 TGATCGGCGA ACTGGACAAG CAGACCGGCG ACCTTCTCGA GGAAATCTAC 
601 CGCGAGCTGG ACGCGCCGAT CATCCGCAAG ACCGTCGAGG TCGCCGAGAT 
651 GATCAAGTAC ACCTGCAACG TCTGGCACGC CGCCAAGGTC ACCTTCGCCA 
701 ACGAGATCGG CAACATCGCC AAGGCGGTCG GCGTCGACGG CCGCGAGGTG 
751 ATGGACGTGA TCTGCCAGGA CCACAAGCTC AACCTGTCGC GCTACTACAT 
801 GCGTCCCGGC TTCGCCTTCG GCGGCTCCTG CCTGCCCAAG GATGTACGCG 
851 CCCTCACCTA TCGCGCCAGC CAGCTGGACG TCGAGCACCC GATGCTCGGT 
901 TCGTTGATGC GCAGCAACTC CAACCAGGTG CAGAAGGCCT TCGATCTCAT 
951 CACCAGCCAC GACACCCGCA AGGTCGGCCT GCTCGGCCTG TCGTTCAAGG 
1001 CCGGCACCGA CGATTTGCGC GAAAGCCCGC TGGTGGAGCT GGCCGAGATG 
1051 CTCATCGGCA AGGGCTACGA GTTCCGCATC TTCGACCGCA ACGTCGAATA 
1101 CGCGCGTGTC CACGGGGCCA ACAAGGAATA CATCGAGTCG AAGATCCCGC 
1151 ACGTCTCCTC GCTGCTGGTC TCCGACCTCG ACGAAGTGGT GGCGAGTTCC 
1201 GATGTGCTGG TGCTGGGCAA TGGCGACGAG CTGTTCGTCG ACCTGGTGAA 
1251 CAAGACCCCG AGCGGCAAGA AGCTGGTCGA CCTGGTGGGC TTCATGCCGC 
1301 ACACCACCAC TGCCCAGGCC GAGGGCATCT GCTGGTAGCG G 
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PA 8830 algD protein SEQ ID NO: 134 

1 MRISIFGLGY VGAVCAGCLS ARGHEVIGVD VSSTKIDLIN QGKSPIVEPG 
51 LEALLQQGRQ TGRLSGTTDF KKAVLDSDVS FICVGTPSKK NGDLDLGYIE 
101 TVCREIGFAI REKSERHTW VRSTVLPGTV NNWIPLIED CSGKKAGVDF 
151 GVGTNPEFLR ESTAIKDYDF PPMTVIGELD KQTGDLLEEI YRELDAPIIR 
201 KTVEVAEMIK YTCNVWHAAK VTFANEIGNI AKAVGVDGRE VMDVICQDHK 
251 LNLSRYYMRP GFAFGGSCLP KDVRALTYRA SQLDVEHPML GSLMRSNSNQ 
301 VQKAFDLITS HDTRKVGLLG LSFKAGTDDL RESPLVELAE MLIGKGYEFR 
351 IFDRNVEYAR VHGANKEYIE SKIPHVSSLL VSDLDEWAS SDVLVLGNGD 
401 ELFVDLVNKT PSGKKLVDLV GFMPHTTTAQ AEGICW 
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>Contigll26 of Mutant 25A12 SEQ ID NO: 135 

. . AACACC6GACGCGCCCCGATCATGT6CGCTGAGCGCTACGCTACCGTCAA 

CGAAAAAGGCCACCTCGGGGTGGCCTTTTCGCGTTCTCGCACCGATCGCG 

CGGAATATCGGCGGTTAACGCCTCTCCCCCGTGCGCACCTGCGGCTGAGC 

CTCAGAACGAAGTCCGGCGGTAGGCACGGTAGCGCGGGAACCAGAAGTTC 

GCCTCGATGGCGTCGTTCAGTACCTCGTCGCTGGTATGCAGGGCCTTGCC 

CTCGGCCTGGGCCTGCTTGGCCACGGCGACGGCGATGCGCTTGCTGACCT 

CGCGGATGTCGCCCAGCGCCGGCAACACGGCGCCCTCGCCCTGGGTAACG 

ATCGGCGAGCAGTTGGCCAGGGCGTTGGCCGCGGCCATCAGCATGCCTTC 

GGTGACCCGATTGGCCCGCGCGGCGATCACCCCCAGGCCGATGCCGGGGA 

AGATATAGGCGTTGTTGCACTGGGCGATGGGAATCCGCTTGTCGCCCACC 

TGCACCGGTTGGAACGGGCTACCGGTGGCGACCAGCGCCTGGCCGTCGGT 

CCAGTTGAGGATTTCCTGCGGAGTCGCCTCGACCCGCGAGGTCGGGTTGG 

ACAGCGGCATCACCAGCGGCTGCTTGCAATGGCTGTGCAGCTCACGGATG 

ACCTCTTCGGAAAACAGCCCGCGCTGCCCGGAGACGCCGATCAGCACCGT 

CGGCCGGGCATTGCGGATCACTTCCAGCAACGCCAGGTCGTCGCCCTGCT 

GGCCGCCCCAGGCACCGAGATCGGCGCGCTTCTGCGCCAGGCGGTGCTGG 

AAGTCGACCAGGTTGCTCATGTCGTCGGTGAGCAGGCCCCAGCGGTCGAC 

CATGAAGATGCGCCGACGCGCCTGGGCCTCGTCCAGGCCCTCCAGTTGCA 

TGGCGGCGATGATCTGTTCGGCGATGCCGCAACCGGCGGAGGGGCGCCGA 

CGAAGGTCACGGTCTGCTCGCTGAGCTTCTCGCCCTTGGCCTTGCAAGCC 

GCCAGCAGGGTGCCCACGGCCACCGCGGCGGTGCCCTGGATGTCGTCGTT 

GAAGCAGCACAGCTCGTCCTTGTAGCGCTCCAGCAACGGCATGGCATTGG 

TCTGGGCGAAGTCCTCGAATTGCAGCAGGACGTTGGGCCAGCGGCGCTTG 

ATCGCCTGGATGAACAGGTCGACGAACTCCTCGTACTGCGCCCCGCTCAC 

CCGCTCGTGGCGCCACCCAATGTACATCGGGTCGTTGAGCAGGTCCGGGT 

TGTTGGTGCCGACGTCCAGCACCACCGGCAGGGTGTAGGCCGGGCTGATA 

CCGCCGCAGGTGTAACAGGGGACAGCTTGCCGATCGGGATGCCCATCCGG 

CCGATGCCCTGGTTGCCGAGGGCGAGGATCGGCTGGCTGTCGGTACAAAA 

CAATCTAAGGTGTCTTTGGTGGCTTGAAGGAGTTTCAATCGTTCGGGCCG 

GGAAGAATAAAGGCCCGGTGGGTCGAAACTTTGAATCTGGAAGGTTGCAA 

ACTGGGGGAAAAAATGGAAATTTTTAAGAGCCTAAGAGCGGAAAAAAGTT 

CTTTTTCTAAAAAGAAAAAAATGGGGAAAAAGTTGAAAAGTATATGATAA 

GAGCAGGTGTCAAAATGAATGTTTTGAAAGCCCAGTGAAATAAACTCTGG 

AAAAGGCAGTTATAAGGGCTATAAAAGGGATGAAAAAAGAAGTGTGTGAA 

ATAACGAAAGGCAATAGGGAAAA 
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Sequence: 33C7 coniig From: 1 To: 2048 

10 20 30 40 50 60 

I I I I I II M 1 I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I 1 I I I I I I 1 I 1 

AGCTTATGCA TGCGGCCGCA TCTAGAGGGC CCGGATCCGG TGACCATCGG TCACCGGCAT 60 
GCCGGTGGTT TCGGTATCCA GTACGACGCT ACGCATCTAT AGAGCCTTTC TCTGTTTCGC 120 
TGCAGCCGTG GCTGCTGAAC GCTTGTTTCG GTGTGGCCGC TCAGCGCGGC AATTCGGCGA 180 
CGCCACGGTT GGCCAACTGG TCGGCCCGCT CGTTGCCGGG GTCGCCGGTA TGCCCGCGGA 240 
CCCACTGCCA CTCCACCTGG TGCCGGGCGA CCTGTTCATC CAGGGCCTGC CAGAGGTCGG 300 

310 320 330 340 350 360 
i ii i i i i i i I i i i i I i i i i I i i i i I i i ii I i i i i I 1 1 i i i i i i i I i i i i I — i i i i I i i i i I 

CATTCTTGAC AGGCTGCTTG CTGGCGGTCT TCCAGCCGCG CTTCTTCCAG TTCGGCAACC 360 
ATTCGGTGAT GCCGCGCATC ACGTATTCCG AGTCGGTGAT CAGACGGATC GGACAGGAAC 420 
GCTTGAGTGC CGCCAGCGCC TGGATCGCCG CCATCAGCTC CATGCGGTTG TTGGTGGTGT 480 
CCGGCTCGCC GCCCCAAAGC TCTCGCTCGG CGCCCTTGTA GAGGAGCAAC GCCCCCCAGC 540 
CGCCGCGCCC AGGGTTGCCC TTGCAGGCGC CGTCGGTATA GATCACTACC TGTTCTTTAT 600 

610 620 630 640 650 660 
i i ii I i i i i I i i i i I ii i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I — i i i I 1 i i i i I 

CTGTCATGCC TAAATTTCGG AATCTCGCCG GCTGACTTTC GCCACCGGCA TGGGCACCAG 660 
CTGACCGCGC GGTTCGCGCT TGCTCTGGCG CAACGGGCGC AACCCCACGA CCAGCTTGCG 720 
TGCCACCAAT AGATAGAAGC CGGCGCCCGA AGACTGCCAG GCGTCGCCCC AGCGCTCCAG 780 
GCGAGCCAGG CGCGATTGCC AGGCTGCCGA CGCAAGCGGC GGACGATAGC ACCCGAAGCG 840 
CCGTTTCTCC AGCGCGAAGC CCAGCAGGTT GAGCCAATCG CAGGCCCGCG ACGGAGGAAT 900 
910 920 930 940 950 960 

n i i I i i i i 1 - i i i i I i i i i 1 i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i 1 i I I I 1 

GCAGCGGGCC TGGCGCAAGG CATCCCCGGC GAAATAATGA CGGATGCCCC ACAGGCTCCA 960 
TGGGTTGATG CCGATCAGCA GCAGGTGGCC GCCCGGACGA ACGGTACGCG CGGCTTCGCG 1020 
CAGGAGACGG TGAGGCGACA GGCAGAAATC CAGGCCGTGT TGCAGCAGGA CCACGTCCGC 1080 
GGCATGTTCG CTGAGCGGCC AGGCGCCCTC TTCGCAGGCG ATGTCCACGC CCGGCAGCGG 1140 
CGGCCCCAGG CGCACGCCGC GCTGAATCTG CCCGGTGCTC GGCGGCAGTT CGGCATGCGG 1200 
1210 1220 1230 1240 1250 1260 

I I I I I I I I l I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 1 I I 1 I I I I I 1 

CCCGTAGTGC ACCAGGTAGC CACCGAAGTA ACGGGTCAGC TCGTCGCACA ACAGGCGTCG 1260 
CTCCTCGGCC AGCATCAGGC TGCCCAGCGG GCCCTGGAAC CAGTCGCGCG CCCGGTTGAT 1320 
CGATGCCAGC CACTCGGCAT CGGTCTGGGC GAAGGCTTGC GGTTCGTTCA TGCGTACCTC 1380 
CAGCGTCTTC CCCTTCGCGG CGACGGACGC CGGCACGACG GGAAAATAAG CAATACTATG 1440 
CGCCAATGAC TTCTGCTTAG CGACATCGAC CCATGATACA GATCGACGCC CTGCCCGCCT 1500 
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Sequence: 33C7 contig From: 1 To: 2048 (continued) 

, 1510 1520 1530 1540 1550 1560 

i i i i I i i i i I i i i i I i i 1 1 I i i i i 1 i i i i I i i i i I i i i i I i i M I i i i i I i i i i l i i i i I 

TCAACGACAA CTACATCTGG CTGTTGCAAG ATGCGACAAG CCGTCGCTGC GCGGTGGTCG 1560 
ACCCCGGCGA TGCCAAGCCG GTGGAAGCCT GGCTGGCCGC CCATCCCGAC TGGCGGTTGA 1620 
GCGATATCCT GGTGACCCAC CACCATCACG ACCACGTCGG CGGCGTCGCG GCCCTGAAGG 1680 
AACTGACCGG CGCGCGGGTT CTCGGCCCGG CCAACGAGAA GATCCCGGCC CGCGACCTGG 1740 
CGCTGGAAGA CGGCGAACGG GTCGAGGTGC TCGGCCTGGT CTTCGAGATC TTCCACGTGC 1800 
1810 1820 1830 1840 1850 1860 

i i i i 1 i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i l i i i i I 

CCGGCCATAC CCTCGGCCAT ATCGCCTACT ACCACCCGGC GGAGACGCCG CTGCTGTTCT 1860 
GCGGCGACAC CCTGTTCGCC GCCGGCTGCG GCCGTCTCTT CGAAGGCACC CCGGCGCAGA 1920 
TGCACCATTC CCTGGCGCGA CTGGCCGCGC TGCCGGCCAA CACCCGGGTC TACTGCACCC 1980 
ACGAGTACAC GCTGAGCAAC CTGCGCTTCG CGCTGGCGGT GGAGCCCGAC AACGCGGCGC 2040 
TGCGGGAA 2048 

Fig. 32C 



33C7 ORF A 

ATGAACGAAC CGCAAGCCTT CGCCCAGACC GATGCCGAGT 40 
GGCTGGCATC GATCAACCGG GCGCGCGACT GGTTCCAGGG 80 
CCCGCTGGGC- AGCCTGATGC -TGGCCGAGGA GCGACGCCTG 120 
TTGTGCGACG AGCTGACCCG TTACTTCGGT GGCTACCTGG 160 
TGCACTACGG GCCGCATGCC GAACTGCCGC CGAGCACCGG 200 



GCAGATTCAG CGCGGCGTGC GCCTGGGGCC GCCGCTGCCG 240 
GGCGTGGACA TCGCCTGCGA AGAGGGCGCC TGGCCGCTCA 280 
GCGAACATGC CGCGGACGTG GTCCTGCTGC AACACGGCCT 320 
GGATTTCTGC CTGTCGCCTC ACCGTCTCCT GCGCGAAGCC 360 
GCGCGTACCG TTCGTCCGGG CGGCCACCTG CTGCTGATCG 400 



GCATCAACCC ATGGAGCCTG TGGGGCATCC GTCATTATTT 440 
CGCCGGGGAT GCCTTGCGCC AGGCCCGCTG CATTCCTCCG 480 
TCGCGGGCCT GCGATTGGCT CAACCTGCTG GGCTTCGCGC 520 
TGGAGAAACG GCGCTTCGGG TGCTATCGTC CGCCGCTTGC 560 
GTCGGCAGCC TGGCAATCGC GCCTGGCTCG CCTGGAGCGC 600 



TGGGGCGACG CCTGGCAGTC TTCGGGCGCC GGCTTCTATC 640 
TATTGGTGGC ACGCAAGCTG GTCGTGGGGT TGCGCCCGTT 680 
GCGCCAGAGC AAGCGCGAAC CGCGCGGTCA GCTGGTGCCC 720 
ATGCCGGTGG CGAAAGTCAG CCGGCGAGAT TCCGAAATTT 760 
AG 762 

Fig. 32D 
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Sequence: 33C7 ORF A From: 1 To: 254 

10 20 30 40 

i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 

MNEPQAFAQT DAEWLASINR ARDWFQGPLG SLMLAEERRL 40 
LCDELTRYFG GYLVHYGPHA ELPPSTGQIQ RGVRLGPPLP 80 
GVDIACEEGA WPLSEHAADV VLLQHGLDFC LSPHRLLREA 120 
ARTVRPGGHL LLIGINPWSL WGIRHYFAGD ALRQARCIPP 160 
SRACDWLNLL GFALEKRRFG CYRPPLASAA WQSRLARLER 200 

WGDAWQSSGA GFYLLVARKL WGLRPLRQS KREPRGQLVP 240 
MPVAKVSRRD SEI. 254 

Fig. 32E 

Sequence: 33C7 ORF B From: 1 To: 801 

, 10 , 20 30 40 50 60 

'' ■' I t'l l ! I I I I I I I I I I l l l l I I i i i 1 i i i i I ii i i I i i i i I i i i i I i i i i I i i i i 1 

ATGGAGCCTG TGGGGCATCC GTCATTATTT CGCCGGGGAT GCCTTGCGCC AGGCCCGCTG 60 
CATTCCTCCG TCGCGGGCCT GCGATTGGCT CAACCTGCTG GGCTTCGCGC TGGAGAAACG 120 
GCGCTTCGGG TGCTATCGTC CGCCGCTTGC GTCGGCAGCC TGGCAATCGC GCCTGGCTCG 180 
CCTGGAGCGC TGGGGCGACG CCTGGCAGTC TTCGGGCGCC GGCTTCTATC TATTGGTGGC 240 
ACGCAAGCTG GTCGTGGGGT TGCGCCCGTT GCGCCAGAGC AAGCGCGAAC CGCGCGGTCA 300 

, 310 320 330 340 350 360 

' ' i ' 1 ' i ' i I i i i 1 1 i 1 1 i I i i i 1 1 i 1 1 1 I i i i 1 1 1 i 1 1 I i i i i 1 1 i 1 1 1 1 1 i 1 1 1 1 i i I 

GCTGGTGCCC ATGCCGGTGG CGAAAGTCAG CCGGCGAGAT TCCGAAATTT AGGCATGACA 360 
GATAAAGAAC AGGTAGTGAT CTATACCGAC GGCGCCTGCA AGGGCAACCC TGGGCGCGGC 420 
GGCTGGGGGG CGTTGCTCCT CTACAAGGGC GCCGAGCGAG AGCTTTGGGG CGGCGAGCCG 480 
GACACCACCA ACAACCGCAT GGAGCTGATG GCGGCGATCC AGGCGCTGGC GGCACTCAAG 540 
CGTTCCTGTC CGATCCGTCT GATCACCGAC TCGGAATACG TGATGCGCGG CATCACCGAA 600 

610 620 630 640 650 660 

' ' i 1 1 1 i ii I 1 1 i 1 1 1 i 1 1 I i i i 1 1 1 i 1 1 I i 1 1 i 1 1 1 M I 1 1 i i 1 1 i i i I i 1 1 1 1 1 i i i l 

TGGTTGCCGA ACTGGAAGAA GCGCGGCTGG AAGACCGCCA GCAAGCAGCC TGTCAAGAAT 660 
GCCGACCTCT GGCAGGCCCT GGATGAACAG GTCGCCCGGC ACCAGGTGGA GTGGCAGTGG 720 
GTCCGCGGGC ATACCGGCGA CCCCGGCAAC GAGCGGGCCG ACCAGTTGGC CAACCGTGGC 780 
GTCGCCGAAT TGCCGCGCTG A 801 

Fig. 32F 

Sequence: 33C7 ORF B PROTEIN From: 1 To: 267 

10 20 30 40 50 

i H i I i i i i I i i i i 1 i ii i I i ii i i i i i i I i i i i I i i i i I i i i i i i i i i 1 

MEPVGHPSLF RRGCLAPGPL HSSVAGLRLA QPAGLRAGET ALRVLSSAAC 50 
VGSLAIAPGS PGALGRRLAV FGRRLLSIGG TQAGRGVAPV APEQARTARS 100 
AGAHAGGESQ PARFRNLGMT DKEQWIYTD GACKGNPGRG GWGALLLYKG 150 
AERELWGGEP DTTNNRMELM AAIQALAALK RSCPIRLITD SEYVMRGITE 200 
WLPNWKKRGW KTASKQPVKN ADLWQALDEQ VARHQVEWQW VRGHTGDPGN 250 

260 270 280 290 300 
i i i i 1 i i ii I i i i i I i i i i 1 ii i i I i i i i i i i i i I i i i i 1 i i i i I i i i i I 

ERADQLANRG VAELPR. 267 

Fig. 32G 
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33C7 ORF C 

10 20 30 40 50 60 



ATGACGGAT6 CCCCACAGGC TCCATGGGTT GATGCCGATC AGCAGCAGGT GGCCGCCCGG 60 
ACGAACGGTA CGCGCGGCTT CGCGCAGGAG ACGGTGAGGC GACAGGCAGA AATCCAGGCC 120 
GTGTTGCAGC AGGACCACGT CCGCGGCATG TTCGCTGAGC GGCCAGGCGC CCTCTTCGCA 180 
GGCGATGTCC ACGCCCGGCA GCGGCGGCCC CAGGCGCACG CCGCGCTGAA TCTGCCCGGT 240 
GCTCGGCGGC AGTTCGGCAT GCGGCCCGTA GTGCACCAGG TAGCCACCGA AGTAACGGGT 300 
310 320 330 340 350 360 

M I I 1 I I I I I I I I I I I [ I I I I I I I I l l l I I l l l l I l l i l I l l l l I l l l I I l l l l I l i i i I 

CAGCTCGTCG CACAACAGGC GTCGCTCCTC GGCCAGCATC AGGCTGCCCA GCGGGCCCTG 360 
GAACCAGTCG CGCGCCCGGT TGATCGATGC CAGCCACTCG GCATCGGTCT GGGCGAAGGC 420 
TTGCGGTTCG TTCATGCGTA CCTCCAGCGT CTTCCCCTTC GCGGCGACGG ACGCCGGCAC 480 
GACGGGAAAA TAAGCAATAC TATGCGCCAA TGA 513 

Fig. 32H 



Sequence: 33C7 ORF C PROTEIN From: 1 To: 171 

10 20 30 40 50 

i i i [ 1 i i i i I i i i i I ii i i 1 i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 

MTDAPQAPWV DADQQQVAAR TNGTRGFAQE TVRRQAEIQA VLQQDHVRGM 50 
FAERPGALFA GDVHARQRRP QAHAALNLPG ARRQFGMRPV VHQVATEVTG 100 
QLVAQQASLL GQHQAAQRAL EPVARPVDRC QPLGIGLGEG LRFVHAYLQR 150 
LPLRGDGRRH DGKISNTMRQ . 171 



Fig. 321 



128/133 



1G2 SEQ ID NO:137 

1 NTTGTGTTAA GATCAGGCTT GGTGGTGAAG AAAGGTTCGA ACNNGTGGTC 

51 AATGATCNAC TTCGGGGATN CNGCTGCCCG TATNATTCAA CACGTGGTCA 

101 AACGGTATGT TCCGAGGCGT CTGNCCACCN GTACTAGTCG ACGC 



Fig. 33 
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Fig. 34A 
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* PA14(pUCP18) 

<> phol5(pEcdsbA) 

phol5(pPAdsbA) 
. . . -o- . - - phol5(pUCP18) 



.-Cr 



20 



30 



40 50 60 

TIME AFTER EXPOSURE (HOUR) 



Fig. 34B 
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Fig. 34C 
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Fig. 34D 
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PA14 3E8 AphnAB 

Fig. 35C 



